Overview
Brought to you by YData
Dataset statistics
| Number of variables | 88 |
|---|---|
| Number of observations | 290898 |
| Missing cells | 6184922 |
| Missing cells (%) | 24.2% |
| Total size in memory | 195.3 MiB |
| Average record size in memory | 704.0 B |
Variable types
| Text | 88 |
|---|
Dataset
| Description | Naturalis Biodiversity Center (NL) - Aves 0061686-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.u5tv27 |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "Naturalis Biodiversity Center" | Constant |
rightsHolder has constant value "Naturalis Biodiversity Center" | Constant |
institutionID has constant value "https://ror.org/0566bfb96" | Constant |
collectionCode has constant value "Aves" | Constant |
basisOfRecord has constant value "PRESERVED_SPECIMEN" | Constant |
occurrenceStatus has constant value "PRESENT" | Constant |
associatedTaxa has constant value "has parasite: Cirrophthirius cf. recurvirostrae | Quadraceps sp." | Constant |
nomenclaturalCode has constant value "ICZN" | Constant |
datasetKey has constant value "889c91a3-614f-4355-8df8-b6d0260a118c" | Constant |
publishingCountry has constant value "NL" | Constant |
protocol has constant value "DWC_ARCHIVE" | Constant |
lastCrawled has constant value "2025-01-03T11:34:30.428Z" | Constant |
isSequenced has constant value "false" | Constant |
publishedByGbifRegion has constant value "EUROPE" | Constant |
recordNumber has 277608 (95.4%) missing values | Missing |
recordedBy has 93217 (32.0%) missing values | Missing |
individualCount has 30538 (10.5%) missing values | Missing |
sex has 98571 (33.9%) missing values | Missing |
lifeStage has 210308 (72.3%) missing values | Missing |
associatedTaxa has 290895 (> 99.9%) missing values | Missing |
eventDate has 74430 (25.6%) missing values | Missing |
startDayOfYear has 74430 (25.6%) missing values | Missing |
endDayOfYear has 74430 (25.6%) missing values | Missing |
year has 78830 (27.1%) missing values | Missing |
month has 87276 (30.0%) missing values | Missing |
day has 101613 (34.9%) missing values | Missing |
verbatimEventDate has 59902 (20.6%) missing values | Missing |
continent has 94391 (32.4%) missing values | Missing |
island has 200600 (69.0%) missing values | Missing |
countryCode has 47203 (16.2%) missing values | Missing |
stateProvince has 137182 (47.2%) missing values | Missing |
locality has 79647 (27.4%) missing values | Missing |
verbatimElevation has 288311 (99.1%) missing values | Missing |
decimalLatitude has 139112 (47.8%) missing values | Missing |
decimalLongitude has 139112 (47.8%) missing values | Missing |
coordinateUncertaintyInMeters has 289239 (99.4%) missing values | Missing |
typeStatus has 287427 (98.8%) missing values | Missing |
identifiedBy has 290486 (99.9%) missing values | Missing |
dateIdentified has 290641 (99.9%) missing values | Missing |
specificEpithet has 10799 (3.7%) missing values | Missing |
infraspecificEpithet has 125699 (43.2%) missing values | Missing |
distanceFromCentroidInMeters has 289238 (99.4%) missing values | Missing |
mediaType has 207500 (71.3%) missing values | Missing |
speciesKey has 10568 (3.6%) missing values | Missing |
species has 10568 (3.6%) missing values | Missing |
repatriated has 46939 (16.1%) missing values | Missing |
gbifRegion has 50475 (17.4%) missing values | Missing |
level0Gid has 158562 (54.5%) missing values | Missing |
level0Name has 158562 (54.5%) missing values | Missing |
level1Gid has 159606 (54.9%) missing values | Missing |
level1Name has 159606 (54.9%) missing values | Missing |
level2Gid has 161386 (55.5%) missing values | Missing |
level2Name has 161392 (55.5%) missing values | Missing |
level3Gid has 227914 (78.3%) missing values | Missing |
level3Name has 229332 (78.8%) missing values | Missing |
iucnRedListCategory has 167789 (57.7%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
catalogNumber has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 23:40:02.757842 |
|---|---|
| Analysis finished | 2025-01-08 23:40:11.593245 |
| Duration | 8.84 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 290898 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 290898 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2434047501 |
|---|---|
| 2nd row | 2434047502 |
| 3rd row | 2434047503 |
| 4th row | 2434047504 |
| 5th row | 2434047505 |
| Value | Count | Frequency (%) |
| 2434047501 | 1 | < 0.1% |
| 2433858690 | 1 | < 0.1% |
| 2434047505 | 1 | < 0.1% |
| 2434047506 | 1 | < 0.1% |
| 2434047507 | 1 | < 0.1% |
| 2434047508 | 1 | < 0.1% |
| 2434047523 | 1 | < 0.1% |
| 2434047509 | 1 | < 0.1% |
| 2433858683 | 1 | < 0.1% |
| 2434047503 | 1 | < 0.1% |
| Other values (290888) | 290888 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 648315 | |
| 3 | 508615 | |
| 2 | 477380 | |
| 1 | 244327 | 8.4% |
| 0 | 214623 | 7.4% |
| 9 | 195309 | 6.7% |
| 8 | 174998 | 6.0% |
| 7 | 151616 | 5.2% |
| 5 | 148917 | 5.1% |
| 6 | 144880 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2908980 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 648315 | |
| 3 | 508615 | |
| 2 | 477380 | |
| 1 | 244327 | 8.4% |
| 0 | 214623 | 7.4% |
| 9 | 195309 | 6.7% |
| 8 | 174998 | 6.0% |
| 7 | 151616 | 5.2% |
| 5 | 148917 | 5.1% |
| 6 | 144880 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2908980 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 648315 | |
| 3 | 508615 | |
| 2 | 477380 | |
| 1 | 244327 | 8.4% |
| 0 | 214623 | 7.4% |
| 9 | 195309 | 6.7% |
| 8 | 174998 | 6.0% |
| 7 | 151616 | 5.2% |
| 5 | 148917 | 5.1% |
| 6 | 144880 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2908980 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 648315 | |
| 3 | 508615 | |
| 2 | 477380 | |
| 1 | 244327 | 8.4% |
| 0 | 214623 | 7.4% |
| 9 | 195309 | 6.7% |
| 8 | 174998 | 6.0% |
| 7 | 151616 | 5.2% |
| 5 | 148917 | 5.1% |
| 6 | 144880 | 5.0% |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 581796 | |
| 0 | 581796 | |
| _ | 581796 | |
| 1 | 290898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 872694 | |
| Uppercase Letter | 581796 | |
| Connector Punctuation | 581796 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 581796 | |
| 1 | 290898 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 581796 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 581796 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1454490 | |
| Latin | 581796 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 581796 | |
| _ | 581796 | |
| 1 | 290898 |
Latin
| Value | Count | Frequency (%) |
| C | 581796 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2036286 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 581796 | |
| 0 | 581796 | |
| _ | 581796 | |
| 1 | 290898 |
modified
Text
| Distinct | 1170 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 229 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2015-06-05T00:00:00Z |
|---|---|
| 2nd row | 2023-05-16T00:00:00Z |
| 3rd row | 2015-09-02T00:00:00Z |
| 4th row | 2017-07-01T00:00:00Z |
| 5th row | 2015-05-23T00:00:00Z |
| Value | Count | Frequency (%) |
| 2017-06-30t00:00:00z | 48811 | |
| 2023-05-16t00:00:00z | 41000 | |
| 2017-07-01t00:00:00z | 26280 | 9.0% |
| 2015-05-23t00:00:00z | 17611 | 6.1% |
| 2015-07-03t00:00:00z | 13223 | 4.5% |
| 2015-05-18t00:00:00z | 11421 | 3.9% |
| 2015-07-01t00:00:00z | 10549 | 3.6% |
| 2015-06-24t00:00:00z | 9657 | 3.3% |
| 2015-07-02t00:00:00z | 9646 | 3.3% |
| 2015-06-23t00:00:00z | 9602 | 3.3% |
| Other values (1160) | 93098 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2479444 | |
| - | 581796 | 10.0% |
| : | 581796 | 10.0% |
| 2 | 488823 | 8.4% |
| 1 | 370421 | 6.4% |
| T | 290898 | 5.0% |
| Z | 290898 | 5.0% |
| 5 | 236072 | 4.1% |
| 3 | 147699 | 2.5% |
| 6 | 142899 | 2.5% |
| Other values (4) | 207214 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4072572 | |
| Dash Punctuation | 581796 | 10.0% |
| Other Punctuation | 581796 | 10.0% |
| Uppercase Letter | 581796 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2479444 | |
| 2 | 488823 | 12.0% |
| 1 | 370421 | 9.1% |
| 5 | 236072 | 5.8% |
| 3 | 147699 | 3.6% |
| 6 | 142899 | 3.5% |
| 7 | 140323 | 3.4% |
| 8 | 26588 | 0.7% |
| 9 | 21411 | 0.5% |
| 4 | 18892 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 290898 | |
| Z | 290898 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 581796 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 581796 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5236164 | |
| Latin | 581796 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2479444 | |
| - | 581796 | 11.1% |
| : | 581796 | 11.1% |
| 2 | 488823 | 9.3% |
| 1 | 370421 | 7.1% |
| 5 | 236072 | 4.5% |
| 3 | 147699 | 2.8% |
| 6 | 142899 | 2.7% |
| 7 | 140323 | 2.7% |
| 8 | 26588 | 0.5% |
| Other values (2) | 40303 | 0.8% |
Latin
| Value | Count | Frequency (%) |
| T | 290898 | |
| Z | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5817960 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2479444 | |
| - | 581796 | 10.0% |
| : | 581796 | 10.0% |
| 2 | 488823 | 8.4% |
| 1 | 370421 | 6.4% |
| T | 290898 | 5.0% |
| Z | 290898 | 5.0% |
| 5 | 236072 | 4.1% |
| 3 | 147699 | 2.5% |
| 6 | 142899 | 2.5% |
| Other values (4) | 207214 | 3.6% |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Naturalis Biodiversity Center |
|---|---|
| 2nd row | Naturalis Biodiversity Center |
| 3rd row | Naturalis Biodiversity Center |
| 4th row | Naturalis Biodiversity Center |
| 5th row | Naturalis Biodiversity Center |
| Value | Count | Frequency (%) |
| naturalis | 290898 | |
| biodiversity | 290898 | |
| center | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1163592 | |
| t | 872694 | |
| r | 872694 | |
| e | 872694 | |
| 581796 | 6.9% | |
| s | 581796 | 6.9% |
| a | 581796 | 6.9% |
| d | 290898 | 3.4% |
| C | 290898 | 3.4% |
| y | 290898 | 3.4% |
| Other values (7) | 2036286 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6981552 | |
| Uppercase Letter | 872694 | 10.3% |
| Space Separator | 581796 | 6.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1163592 | |
| t | 872694 | |
| r | 872694 | |
| e | 872694 | |
| s | 581796 | |
| a | 581796 | |
| d | 290898 | 4.2% |
| y | 290898 | 4.2% |
| v | 290898 | 4.2% |
| o | 290898 | 4.2% |
| Other values (3) | 872694 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 290898 | |
| N | 290898 | |
| B | 290898 |
Space Separator
| Value | Count | Frequency (%) |
| 581796 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7854246 | |
| Common | 581796 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1163592 | |
| t | 872694 | |
| r | 872694 | |
| e | 872694 | |
| s | 581796 | 7.4% |
| a | 581796 | 7.4% |
| d | 290898 | 3.7% |
| C | 290898 | 3.7% |
| y | 290898 | 3.7% |
| v | 290898 | 3.7% |
| Other values (6) | 1745388 |
Common
| Value | Count | Frequency (%) |
| 581796 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8436042 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1163592 | |
| t | 872694 | |
| r | 872694 | |
| e | 872694 | |
| 581796 | 6.9% | |
| s | 581796 | 6.9% |
| a | 581796 | 6.9% |
| d | 290898 | 3.4% |
| C | 290898 | 3.4% |
| y | 290898 | 3.4% |
| Other values (7) | 2036286 |
rightsHolder
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Naturalis Biodiversity Center |
|---|---|
| 2nd row | Naturalis Biodiversity Center |
| 3rd row | Naturalis Biodiversity Center |
| 4th row | Naturalis Biodiversity Center |
| 5th row | Naturalis Biodiversity Center |
| Value | Count | Frequency (%) |
| naturalis | 290898 | |
| biodiversity | 290898 | |
| center | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1163592 | |
| t | 872694 | |
| r | 872694 | |
| e | 872694 | |
| 581796 | 6.9% | |
| s | 581796 | 6.9% |
| a | 581796 | 6.9% |
| d | 290898 | 3.4% |
| C | 290898 | 3.4% |
| y | 290898 | 3.4% |
| Other values (7) | 2036286 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6981552 | |
| Uppercase Letter | 872694 | 10.3% |
| Space Separator | 581796 | 6.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1163592 | |
| t | 872694 | |
| r | 872694 | |
| e | 872694 | |
| s | 581796 | |
| a | 581796 | |
| d | 290898 | 4.2% |
| y | 290898 | 4.2% |
| v | 290898 | 4.2% |
| o | 290898 | 4.2% |
| Other values (3) | 872694 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 290898 | |
| N | 290898 | |
| B | 290898 |
Space Separator
| Value | Count | Frequency (%) |
| 581796 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7854246 | |
| Common | 581796 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1163592 | |
| t | 872694 | |
| r | 872694 | |
| e | 872694 | |
| s | 581796 | 7.4% |
| a | 581796 | 7.4% |
| d | 290898 | 3.7% |
| C | 290898 | 3.7% |
| y | 290898 | 3.7% |
| v | 290898 | 3.7% |
| Other values (6) | 1745388 |
Common
| Value | Count | Frequency (%) |
| 581796 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8436042 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1163592 | |
| t | 872694 | |
| r | 872694 | |
| e | 872694 | |
| 581796 | 6.9% | |
| s | 581796 | 6.9% |
| a | 581796 | 6.9% |
| d | 290898 | 3.4% |
| C | 290898 | 3.4% |
| y | 290898 | 3.4% |
| Other values (7) | 2036286 |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 25 |
| Mean length | 25 |
| Min length | 25 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | https://ror.org/0566bfb96 |
|---|---|
| 2nd row | https://ror.org/0566bfb96 |
| 3rd row | https://ror.org/0566bfb96 |
| 4th row | https://ror.org/0566bfb96 |
| 5th row | https://ror.org/0566bfb96 |
| Value | Count | Frequency (%) |
| https://ror.org/0566bfb96 | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 872694 | |
| r | 872694 | |
| 6 | 872694 | |
| t | 581796 | 8.0% |
| o | 581796 | 8.0% |
| b | 581796 | 8.0% |
| h | 290898 | 4.0% |
| p | 290898 | 4.0% |
| s | 290898 | 4.0% |
| : | 290898 | 4.0% |
| Other values (6) | 1745388 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4072572 | |
| Decimal Number | 1745388 | |
| Other Punctuation | 1454490 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 872694 | |
| t | 581796 | |
| o | 581796 | |
| b | 581796 | |
| h | 290898 | 7.1% |
| p | 290898 | 7.1% |
| s | 290898 | 7.1% |
| g | 290898 | 7.1% |
| f | 290898 | 7.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 872694 | |
| 0 | 290898 | 16.7% |
| 5 | 290898 | 16.7% |
| 9 | 290898 | 16.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 872694 | |
| : | 290898 | 20.0% |
| . | 290898 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4072572 | |
| Common | 3199878 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 872694 | |
| t | 581796 | |
| o | 581796 | |
| b | 581796 | |
| h | 290898 | 7.1% |
| p | 290898 | 7.1% |
| s | 290898 | 7.1% |
| g | 290898 | 7.1% |
| f | 290898 | 7.1% |
Common
| Value | Count | Frequency (%) |
| / | 872694 | |
| 6 | 872694 | |
| : | 290898 | 9.1% |
| . | 290898 | 9.1% |
| 0 | 290898 | 9.1% |
| 5 | 290898 | 9.1% |
| 9 | 290898 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7272450 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 872694 | |
| r | 872694 | |
| 6 | 872694 | |
| t | 581796 | 8.0% |
| o | 581796 | 8.0% |
| b | 581796 | 8.0% |
| h | 290898 | 4.0% |
| p | 290898 | 4.0% |
| s | 290898 | 4.0% |
| : | 290898 | 4.0% |
| Other values (6) | 1745388 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Aves |
|---|---|
| 2nd row | Aves |
| 3rd row | Aves |
| 4th row | Aves |
| 5th row | Aves |
| Value | Count | Frequency (%) |
| aves | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 290898 | |
| v | 290898 | |
| e | 290898 | |
| s | 290898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 872694 | |
| Uppercase Letter | 290898 | 25.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| v | 290898 | |
| e | 290898 | |
| s | 290898 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 290898 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1163592 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 290898 | |
| v | 290898 | |
| e | 290898 | |
| s | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1163592 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 290898 | |
| v | 290898 | |
| e | 290898 | |
| s | 290898 |
basisOfRecord
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 18 |
| Min length | 18 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESERVED_SPECIMEN |
|---|---|
| 2nd row | PRESERVED_SPECIMEN |
| 3rd row | PRESERVED_SPECIMEN |
| 4th row | PRESERVED_SPECIMEN |
| 5th row | PRESERVED_SPECIMEN |
| Value | Count | Frequency (%) |
| preserved_specimen | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1454490 | |
| P | 581796 | 11.1% |
| R | 581796 | 11.1% |
| S | 581796 | 11.1% |
| V | 290898 | 5.6% |
| D | 290898 | 5.6% |
| _ | 290898 | 5.6% |
| C | 290898 | 5.6% |
| I | 290898 | 5.6% |
| M | 290898 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4945266 | |
| Connector Punctuation | 290898 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1454490 | |
| P | 581796 | 11.8% |
| R | 581796 | 11.8% |
| S | 581796 | 11.8% |
| V | 290898 | 5.9% |
| D | 290898 | 5.9% |
| C | 290898 | 5.9% |
| I | 290898 | 5.9% |
| M | 290898 | 5.9% |
| N | 290898 | 5.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 290898 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4945266 | |
| Common | 290898 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1454490 | |
| P | 581796 | 11.8% |
| R | 581796 | 11.8% |
| S | 581796 | 11.8% |
| V | 290898 | 5.9% |
| D | 290898 | 5.9% |
| C | 290898 | 5.9% |
| I | 290898 | 5.9% |
| M | 290898 | 5.9% |
| N | 290898 | 5.9% |
Common
| Value | Count | Frequency (%) |
| _ | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5236164 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1454490 | |
| P | 581796 | 11.1% |
| R | 581796 | 11.1% |
| S | 581796 | 11.1% |
| V | 290898 | 5.6% |
| D | 290898 | 5.6% |
| _ | 290898 | 5.6% |
| C | 290898 | 5.6% |
| I | 290898 | 5.6% |
| M | 290898 | 5.6% |
occurrenceID
Text
Unique 
| Distinct | 290898 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 77 |
|---|---|
| Median length | 71 |
| Mean length | 67.20245241 |
| Min length | 62 |
Unique
| Unique | 290898 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | https://data.biodiversitydata.nl/naturalis/specimen/ZMA.AVES.2 |
|---|---|
| 2nd row | https://data.biodiversitydata.nl/naturalis/specimen/RMNH.AVES.4 |
| 3rd row | https://data.biodiversitydata.nl/naturalis/specimen/ZMA.AVES.18 |
| 4th row | https://data.biodiversitydata.nl/naturalis/specimen/ZMA.AVES.27 |
| 5th row | https://data.biodiversitydata.nl/naturalis/specimen/ZMA.AVES.36 |
| Value | Count | Frequency (%) |
| https://data.biodiversitydata.nl/naturalis/specimen/zma.aves.2 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/rmnh.5069558 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/zma.aves.36 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/zma.aves.45 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/zma.aves.54 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/zma.aves.72 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/zma.aves.222 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/zma.aves.81 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/rmnh.5069738 | 1 | < 0.1% |
| https://data.biodiversitydata.nl/naturalis/specimen/zma.aves.18 | 1 | < 0.1% |
| Other values (290888) | 290888 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1748359 | 8.9% |
| t | 1745388 | 8.9% |
| / | 1454490 | 7.4% |
| i | 1454490 | 7.4% |
| . | 1171254 | 6.0% |
| s | 1163592 | 6.0% |
| d | 872773 | 4.5% |
| e | 872704 | 4.5% |
| n | 872694 | 4.5% |
| l | 581796 | 3.0% |
| Other values (34) | 7611519 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12806742 | |
| Other Punctuation | 2916642 | 14.9% |
| Uppercase Letter | 2257505 | 11.5% |
| Decimal Number | 1568170 | 8.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1748359 | |
| t | 1745388 | |
| i | 1454490 | |
| s | 1163592 | |
| d | 872773 | 6.8% |
| e | 872704 | 6.8% |
| n | 872694 | 6.8% |
| l | 581796 | 4.5% |
| p | 581796 | 4.5% |
| r | 581796 | 4.5% |
| Other values (9) | 2331354 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 353920 | |
| M | 290897 | |
| E | 289127 | |
| S | 289126 | |
| V | 289126 | |
| R | 226103 | |
| N | 226103 | |
| H | 226103 | |
| Z | 64794 | 2.9% |
| P | 2204 | 0.1% |
| Other values (2) | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 229669 | |
| 2 | 204495 | |
| 5 | 154944 | |
| 3 | 152835 | |
| 4 | 146877 | |
| 6 | 140057 | |
| 0 | 137531 | |
| 7 | 135756 | |
| 8 | 134201 | |
| 9 | 131805 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1454490 | |
| . | 1171254 | |
| : | 290898 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15064247 | |
| Common | 4484812 | 22.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1748359 | 11.6% |
| t | 1745388 | 11.6% |
| i | 1454490 | 9.7% |
| s | 1163592 | 7.7% |
| d | 872773 | 5.8% |
| e | 872704 | 5.8% |
| n | 872694 | 5.8% |
| l | 581796 | 3.9% |
| p | 581796 | 3.9% |
| r | 581796 | 3.9% |
| Other values (21) | 4588859 |
Common
| Value | Count | Frequency (%) |
| / | 1454490 | |
| . | 1171254 | |
| : | 290898 | 6.5% |
| 1 | 229669 | 5.1% |
| 2 | 204495 | 4.6% |
| 5 | 154944 | 3.5% |
| 3 | 152835 | 3.4% |
| 4 | 146877 | 3.3% |
| 6 | 140057 | 3.1% |
| 0 | 137531 | 3.1% |
| Other values (3) | 401762 | 9.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19549059 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1748359 | 8.9% |
| t | 1745388 | 8.9% |
| / | 1454490 | 7.4% |
| i | 1454490 | 7.4% |
| . | 1171254 | 6.0% |
| s | 1163592 | 6.0% |
| d | 872773 | 4.5% |
| e | 872704 | 4.5% |
| n | 872694 | 4.5% |
| l | 581796 | 3.0% |
| Other values (34) | 7611519 |
catalogNumber
Text
Unique 
| Distinct | 290898 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 19 |
| Mean length | 15.20245241 |
| Min length | 10 |
Unique
| Unique | 290898 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | ZMA.AVES.2 |
|---|---|
| 2nd row | RMNH.AVES.4 |
| 3rd row | ZMA.AVES.18 |
| 4th row | ZMA.AVES.27 |
| 5th row | ZMA.AVES.36 |
| Value | Count | Frequency (%) |
| zma.aves.2 | 1 | < 0.1% |
| rmnh.5069558 | 1 | < 0.1% |
| zma.aves.36 | 1 | < 0.1% |
| zma.aves.45 | 1 | < 0.1% |
| zma.aves.54 | 1 | < 0.1% |
| zma.aves.72 | 1 | < 0.1% |
| zma.aves.222 | 1 | < 0.1% |
| zma.aves.81 | 1 | < 0.1% |
| rmnh.5069738 | 1 | < 0.1% |
| zma.aves.18 | 1 | < 0.1% |
| Other values (290888) | 290888 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 589458 | |
| A | 353920 | 8.0% |
| M | 290897 | 6.6% |
| E | 289127 | 6.5% |
| V | 289126 | 6.5% |
| S | 289126 | 6.5% |
| 1 | 229669 | 5.2% |
| N | 226103 | 5.1% |
| R | 226103 | 5.1% |
| H | 226103 | 5.1% |
| Other values (21) | 1412731 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2257505 | |
| Decimal Number | 1568170 | |
| Other Punctuation | 589458 | 13.3% |
| Lowercase Letter | 7230 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 353920 | |
| M | 290897 | |
| E | 289127 | |
| V | 289126 | |
| S | 289126 | |
| N | 226103 | |
| R | 226103 | |
| H | 226103 | |
| Z | 64794 | 2.9% |
| P | 2204 | 0.1% |
| Other values (2) | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 229669 | |
| 2 | 204495 | |
| 5 | 154944 | |
| 3 | 152835 | |
| 4 | 146877 | |
| 6 | 140057 | |
| 0 | 137531 | |
| 7 | 135756 | |
| 8 | 134201 | |
| 9 | 131805 |
Lowercase Letter
| Value | Count | Frequency (%) |
| b | 2993 | |
| a | 2971 | |
| c | 1060 | 14.7% |
| x | 106 | 1.5% |
| d | 79 | 1.1% |
| e | 10 | 0.1% |
| y | 10 | 0.1% |
| v | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 589458 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2264735 | |
| Common | 2157628 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 353920 | |
| M | 290897 | |
| E | 289127 | |
| V | 289126 | |
| S | 289126 | |
| N | 226103 | |
| R | 226103 | |
| H | 226103 | |
| Z | 64794 | 2.9% |
| b | 2993 | 0.1% |
| Other values (10) | 6443 | 0.3% |
Common
| Value | Count | Frequency (%) |
| . | 589458 | |
| 1 | 229669 | 10.6% |
| 2 | 204495 | 9.5% |
| 5 | 154944 | 7.2% |
| 3 | 152835 | 7.1% |
| 4 | 146877 | 6.8% |
| 6 | 140057 | 6.5% |
| 0 | 137531 | 6.4% |
| 7 | 135756 | 6.3% |
| 8 | 134201 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4422363 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 589458 | |
| A | 353920 | 8.0% |
| M | 290897 | 6.6% |
| E | 289127 | 6.5% |
| V | 289126 | 6.5% |
| S | 289126 | 6.5% |
| 1 | 229669 | 5.2% |
| N | 226103 | 5.1% |
| R | 226103 | 5.1% |
| H | 226103 | 5.1% |
| Other values (21) | 1412731 |
recordNumber
Text
Missing 
| Distinct | 5837 |
|---|---|
| Distinct (%) | 43.9% |
| Missing | 277608 |
| Missing (%) | 95.4% |
| Memory size | 2.2 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 22 |
| Mean length | 4.631226486 |
| Min length | 1 |
Unique
| Unique | 4106 ? |
|---|---|
| Unique (%) | 30.9% |
Sample
| 1st row | 1.3 |
|---|---|
| 2nd row | 4.3 |
| 3rd row | 6.4 |
| 4th row | 15 |
| 5th row | 175 |
| Value | Count | Frequency (%) |
| no | 3016 | 17.2% |
| reg | 601 | 3.4% |
| reg.no | 175 | 1.0% |
| n | 85 | 0.5% |
| verz | 57 | 0.3% |
| coll.-no | 49 | 0.3% |
| 2 | 47 | 0.3% |
| 3 | 41 | 0.2% |
| 1 | 41 | 0.2% |
| 6 | 34 | 0.2% |
| Other values (4160) | 13389 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 7134 | |
| 4 | 4703 | 7.6% |
| 3 | 4671 | 7.6% |
| 2 | 4607 | 7.5% |
| 4247 | 6.9% | |
| . | 4085 | 6.6% |
| 5 | 3931 | 6.4% |
| 6 | 3619 | 5.9% |
| 7 | 3512 | 5.7% |
| o | 3431 | 5.6% |
| Other values (63) | 17609 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 41965 | |
| Lowercase Letter | 6638 | 10.8% |
| Other Punctuation | 4273 | 6.9% |
| Space Separator | 4247 | 6.9% |
| Uppercase Letter | 4115 | 6.7% |
| Close Punctuation | 103 | 0.2% |
| Open Punctuation | 103 | 0.2% |
| Dash Punctuation | 81 | 0.1% |
| Math Symbol | 24 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3431 | |
| e | 965 | 14.5% |
| g | 815 | 12.3% |
| n | 422 | 6.4% |
| r | 223 | 3.4% |
| l | 215 | 3.2% |
| v | 79 | 1.2% |
| a | 76 | 1.1% |
| z | 73 | 1.1% |
| c | 65 | 1.0% |
| Other values (14) | 274 | 4.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3030 | |
| R | 739 | 18.0% |
| C | 140 | 3.4% |
| I | 65 | 1.6% |
| X | 32 | 0.8% |
| V | 16 | 0.4% |
| A | 15 | 0.4% |
| G | 13 | 0.3% |
| B | 12 | 0.3% |
| L | 12 | 0.3% |
| Other values (13) | 41 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7134 | |
| 4 | 4703 | |
| 3 | 4671 | |
| 2 | 4607 | |
| 5 | 3931 | |
| 6 | 3619 | |
| 7 | 3512 | |
| 8 | 3321 | |
| 0 | 3240 | |
| 9 | 3227 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4085 | |
| : | 105 | 2.5% |
| ' | 30 | 0.7% |
| , | 16 | 0.4% |
| / | 16 | 0.4% |
| ? | 15 | 0.4% |
| ; | 4 | 0.1% |
| & | 1 | < 0.1% |
| … | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 101 | |
| ] | 2 | 1.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 101 | |
| [ | 2 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 4247 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 81 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 24 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 50796 | |
| Latin | 10753 | 17.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3431 | |
| N | 3030 | |
| e | 965 | 9.0% |
| g | 815 | 7.6% |
| R | 739 | 6.9% |
| n | 422 | 3.9% |
| r | 223 | 2.1% |
| l | 215 | 2.0% |
| C | 140 | 1.3% |
| v | 79 | 0.7% |
| Other values (37) | 694 | 6.5% |
Common
| Value | Count | Frequency (%) |
| 1 | 7134 | |
| 4 | 4703 | |
| 3 | 4671 | |
| 2 | 4607 | |
| 4247 | ||
| . | 4085 | |
| 5 | 3931 | |
| 6 | 3619 | |
| 7 | 3512 | |
| 8 | 3321 | |
| Other values (16) | 6966 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 61548 | |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 7134 | |
| 4 | 4703 | 7.6% |
| 3 | 4671 | 7.6% |
| 2 | 4607 | 7.5% |
| 4247 | 6.9% | |
| . | 4085 | 6.6% |
| 5 | 3931 | 6.4% |
| 6 | 3619 | 5.9% |
| 7 | 3512 | 5.7% |
| o | 3431 | 5.6% |
| Other values (62) | 17608 |
Punctuation
| Value | Count | Frequency (%) |
| … | 1 |
recordedBy
Text
Missing 
| Distinct | 11884 |
|---|---|
| Distinct (%) | 6.0% |
| Missing | 93217 |
| Missing (%) | 32.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 252 |
|---|---|
| Median length | 227 |
| Mean length | 15.05396573 |
| Min length | 2 |
Unique
| Unique | 6886 ? |
|---|---|
| Unique (%) | 3.5% |
Sample
| 1st row | Van der Spruyt G.S. |
|---|---|
| 2nd row | Groen J. |
| 3rd row | Pollen&vDam cf Apr'63-Jun'66 |
| 4th row | Ploos van Amstel D. |
| 5th row | Ebels E. |
| Value | Count | Frequency (%) |
| van | 28494 | 5.3% |
| not | 14646 | 2.7% |
| stated | 13574 | 2.5% |
| 12973 | 2.4% | |
| bartels | 11552 | 2.2% |
| j | 10799 | 2.0% |
| de | 10438 | 1.9% |
| heurn | 8706 | 1.6% |
| m.e.g | 8361 | 1.6% |
| f | 7251 | 1.4% |
| Other values (8569) | 408661 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 347920 | 11.7% |
| 339595 | 11.4% | |
| e | 267162 | 9.0% |
| n | 167094 | 5.6% |
| a | 147430 | 5.0% |
| r | 142614 | 4.8% |
| o | 125459 | 4.2% |
| t | 117646 | 4.0% |
| s | 116659 | 3.9% |
| l | 83016 | 2.8% |
| Other values (91) | 1121288 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1637630 | |
| Uppercase Letter | 610104 | 20.5% |
| Other Punctuation | 377070 | 12.7% |
| Space Separator | 339595 | 11.4% |
| Decimal Number | 4048 | 0.1% |
| Open Punctuation | 2717 | 0.1% |
| Close Punctuation | 2714 | 0.1% |
| Dash Punctuation | 1952 | 0.1% |
| Math Symbol | 52 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 267162 | |
| n | 167094 | |
| a | 147430 | |
| r | 142614 | |
| o | 125459 | 7.7% |
| t | 117646 | 7.2% |
| s | 116659 | 7.1% |
| l | 83016 | 5.1% |
| i | 72781 | 4.4% |
| d | 62897 | 3.8% |
| Other values (34) | 334872 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 61957 | 10.2% |
| J | 50762 | 8.3% |
| B | 47748 | 7.8% |
| A | 40657 | 6.7% |
| M | 36414 | 6.0% |
| C | 35281 | 5.8% |
| G | 34607 | 5.7% |
| F | 31023 | 5.1% |
| P | 30114 | 4.9% |
| S | 27132 | 4.4% |
| Other values (17) | 214409 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 347920 | |
| & | 12702 | 3.4% |
| : | 6268 | 1.7% |
| ; | 5183 | 1.4% |
| / | 1659 | 0.4% |
| \ | 1596 | 0.4% |
| ' | 996 | 0.3% |
| ? | 377 | 0.1% |
| " | 294 | 0.1% |
| ! | 60 | < 0.1% |
| Other values (2) | 15 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1074 | |
| 9 | 747 | |
| 0 | 629 | |
| 6 | 456 | |
| 2 | 349 | 8.6% |
| 3 | 311 | 7.7% |
| 8 | 215 | 5.3% |
| 4 | 133 | 3.3% |
| 7 | 76 | 1.9% |
| 5 | 58 | 1.4% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 38 | |
| > | 7 | 13.5% |
| + | 7 | 13.5% |
Space Separator
| Value | Count | Frequency (%) |
| 339595 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2717 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2714 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1952 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2247734 | |
| Common | 728149 | 24.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 267162 | 11.9% |
| n | 167094 | 7.4% |
| a | 147430 | 6.6% |
| r | 142614 | 6.3% |
| o | 125459 | 5.6% |
| t | 117646 | 5.2% |
| s | 116659 | 5.2% |
| l | 83016 | 3.7% |
| i | 72781 | 3.2% |
| d | 62897 | 2.8% |
| Other values (61) | 944976 |
Common
| Value | Count | Frequency (%) |
| . | 347920 | |
| 339595 | ||
| & | 12702 | 1.7% |
| : | 6268 | 0.9% |
| ; | 5183 | 0.7% |
| ( | 2717 | 0.4% |
| ) | 2714 | 0.4% |
| - | 1952 | 0.3% |
| / | 1659 | 0.2% |
| \ | 1596 | 0.2% |
| Other values (20) | 5843 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2968095 | |
| None | 7776 | 0.3% |
| Punctuation | 12 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 347920 | 11.7% |
| 339595 | 11.4% | |
| e | 267162 | 9.0% |
| n | 167094 | 5.6% |
| a | 147430 | 5.0% |
| r | 142614 | 4.8% |
| o | 125459 | 4.2% |
| t | 117646 | 4.0% |
| s | 116659 | 3.9% |
| l | 83016 | 2.8% |
| Other values (71) | 1113500 |
None
| Value | Count | Frequency (%) |
| ü | 5145 | |
| é | 1008 | 13.0% |
| ä | 847 | 10.9% |
| ö | 419 | 5.4% |
| ñ | 143 | 1.8% |
| ø | 118 | 1.5% |
| ë | 34 | 0.4% |
| è | 20 | 0.3% |
| ó | 15 | 0.2% |
| û | 8 | 0.1% |
| Other values (9) | 19 | 0.2% |
Punctuation
| Value | Count | Frequency (%) |
| … | 12 |
individualCount
Text
Missing 
| Distinct | 54 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 30538 |
| Missing (%) | 10.5% |
| Memory size | 2.2 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.003725611 |
| Min length | 1 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 228649 | |
| 2 | 11832 | 4.5% |
| 3 | 6214 | 2.4% |
| 4 | 5617 | 2.2% |
| 5 | 3939 | 1.5% |
| 6 | 1721 | 0.7% |
| 7 | 695 | 0.3% |
| 8 | 426 | 0.2% |
| 9 | 305 | 0.1% |
| 10 | 260 | 0.1% |
| Other values (44) | 702 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 229500 | |
| 2 | 12051 | 4.6% |
| 3 | 6372 | 2.4% |
| 4 | 5687 | 2.2% |
| 5 | 4035 | 1.5% |
| 6 | 1786 | 0.7% |
| 7 | 749 | 0.3% |
| 8 | 468 | 0.2% |
| 9 | 372 | 0.1% |
| 0 | 310 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 261330 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 229500 | |
| 2 | 12051 | 4.6% |
| 3 | 6372 | 2.4% |
| 4 | 5687 | 2.2% |
| 5 | 4035 | 1.5% |
| 6 | 1786 | 0.7% |
| 7 | 749 | 0.3% |
| 8 | 468 | 0.2% |
| 9 | 372 | 0.1% |
| 0 | 310 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 261330 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 229500 | |
| 2 | 12051 | 4.6% |
| 3 | 6372 | 2.4% |
| 4 | 5687 | 2.2% |
| 5 | 4035 | 1.5% |
| 6 | 1786 | 0.7% |
| 7 | 749 | 0.3% |
| 8 | 468 | 0.2% |
| 9 | 372 | 0.1% |
| 0 | 310 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 261330 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 229500 | |
| 2 | 12051 | 4.6% |
| 3 | 6372 | 2.4% |
| 4 | 5687 | 2.2% |
| 5 | 4035 | 1.5% |
| 6 | 1786 | 0.7% |
| 7 | 749 | 0.3% |
| 8 | 468 | 0.2% |
| 9 | 372 | 0.1% |
| 0 | 310 | 0.1% |
sex
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 98571 |
| Missing (%) | 33.9% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.830928575 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FEMALE |
|---|---|
| 2nd row | FEMALE |
| 3rd row | MALE |
| 4th row | MALE |
| 5th row | FEMALE |
| Value | Count | Frequency (%) |
| male | 112422 | |
| female | 79905 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 272232 | |
| M | 192327 | |
| A | 192327 | |
| L | 192327 | |
| F | 79905 | 8.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 929118 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 272232 | |
| M | 192327 | |
| A | 192327 | |
| L | 192327 | |
| F | 79905 | 8.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 929118 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 272232 | |
| M | 192327 | |
| A | 192327 | |
| L | 192327 | |
| F | 79905 | 8.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 929118 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 272232 | |
| M | 192327 | |
| A | 192327 | |
| L | 192327 | |
| F | 79905 | 8.6% |
lifeStage
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 210308 |
| Missing (%) | 72.3% |
| Memory size | 2.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 4.64470778 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Egg |
|---|---|
| 2nd row | Adult |
| 3rd row | Adult |
| 4th row | Immature |
| 5th row | Juvenile |
| Value | Count | Frequency (%) |
| egg | 41586 | |
| adult | 20821 | |
| juvenile | 13228 | 16.4% |
| nestling | 3308 | 4.1% |
| immature | 1546 | 1.9% |
| subadult | 96 | 0.1% |
| embryo | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| g | 86480 | |
| E | 41591 | |
| l | 37453 | |
| u | 35787 | |
| e | 31310 | 8.4% |
| t | 25771 | 6.9% |
| d | 20917 | 5.6% |
| A | 20821 | 5.6% |
| n | 16536 | 4.4% |
| i | 16536 | 4.4% |
| Other values (12) | 41115 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 293727 | |
| Uppercase Letter | 80590 | 21.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| g | 86480 | |
| l | 37453 | |
| u | 35787 | |
| e | 31310 | 10.7% |
| t | 25771 | 8.8% |
| d | 20917 | 7.1% |
| n | 16536 | 5.6% |
| i | 16536 | 5.6% |
| v | 13228 | 4.5% |
| s | 3308 | 1.1% |
| Other values (6) | 6401 | 2.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 41591 | |
| A | 20821 | |
| J | 13228 | 16.4% |
| N | 3308 | 4.1% |
| I | 1546 | 1.9% |
| S | 96 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 374317 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| g | 86480 | |
| E | 41591 | |
| l | 37453 | |
| u | 35787 | |
| e | 31310 | 8.4% |
| t | 25771 | 6.9% |
| d | 20917 | 5.6% |
| A | 20821 | 5.6% |
| n | 16536 | 4.4% |
| i | 16536 | 4.4% |
| Other values (12) | 41115 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 374317 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| g | 86480 | |
| E | 41591 | |
| l | 37453 | |
| u | 35787 | |
| e | 31310 | 8.4% |
| t | 25771 | 6.9% |
| d | 20917 | 5.6% |
| A | 20821 | 5.6% |
| n | 16536 | 4.4% |
| i | 16536 | 4.4% |
| Other values (12) | 41115 |
occurrenceStatus
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 581796 | |
| P | 290898 | |
| R | 290898 | |
| S | 290898 | |
| N | 290898 | |
| T | 290898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2036286 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 581796 | |
| P | 290898 | |
| R | 290898 | |
| S | 290898 | |
| N | 290898 | |
| T | 290898 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2036286 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 581796 | |
| P | 290898 | |
| R | 290898 | |
| S | 290898 | |
| N | 290898 | |
| T | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2036286 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 581796 | |
| P | 290898 | |
| R | 290898 | |
| S | 290898 | |
| N | 290898 | |
| T | 290898 |
preparations
Text
| Distinct | 132 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 39 |
|---|---|
| Median length | 37 |
| Mean length | 16.94541729 |
| Min length | 3 |
Unique
| Unique | 45 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | skin (mounted skin) |
|---|---|
| 2nd row | egg (air dried) |
| 3rd row | skin (study skin) |
| 4th row | skin (mounted skin) |
| 5th row | skin (study skin) |
| Value | Count | Frequency (%) |
| skin | 382852 | |
| air | 114350 | 13.3% |
| dried | 114350 | 13.3% |
| study | 108973 | 12.7% |
| mounted | 47886 | 5.6% |
| egg | 41587 | 4.8% |
| skeletonized | 7000 | 0.8% |
| skeleton | 5297 | 0.6% |
| nest | 4725 | 0.5% |
| whole | 4690 | 0.5% |
| Other values (57) | 27515 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 633806 | |
| 568327 | ||
| s | 523666 | |
| n | 456393 | |
| k | 398662 | |
| d | 395200 | |
| ) | 290700 | 5.9% |
| ( | 290700 | 5.9% |
| e | 260863 | 5.3% |
| r | 234948 | 4.8% |
| Other values (34) | 876123 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3771413 | |
| Space Separator | 568327 | 11.5% |
| Close Punctuation | 290700 | 5.9% |
| Open Punctuation | 290700 | 5.9% |
| Uppercase Letter | 6292 | 0.1% |
| Decimal Number | 1128 | < 0.1% |
| Other Punctuation | 601 | < 0.1% |
| Math Symbol | 217 | < 0.1% |
| Dash Punctuation | 8 | < 0.1% |
| Modifier Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 633806 | |
| s | 523666 | |
| n | 456393 | |
| k | 398662 | |
| d | 395200 | |
| e | 260863 | |
| r | 234948 | 6.2% |
| t | 180647 | 4.8% |
| u | 161596 | 4.3% |
| a | 122202 | 3.2% |
| Other values (13) | 403430 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 5239 | |
| O | 580 | 9.2% |
| H | 322 | 5.1% |
| B | 88 | 1.4% |
| L | 34 | 0.5% |
| D | 8 | 0.1% |
| N | 8 | 0.1% |
| A | 8 | 0.1% |
| T | 5 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 336 | |
| 6 | 336 | |
| 7 | 228 | |
| 0 | 228 |
Other Punctuation
| Value | Count | Frequency (%) |
| % | 564 | |
| & | 37 | 6.2% |
Space Separator
| Value | Count | Frequency (%) |
| 568327 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 290700 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 290700 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 217 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3777705 | |
| Common | 1151683 | 23.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 633806 | |
| s | 523666 | |
| n | 456393 | |
| k | 398662 | |
| d | 395200 | |
| e | 260863 | |
| r | 234948 | 6.2% |
| t | 180647 | 4.8% |
| u | 161596 | 4.3% |
| a | 122202 | 3.2% |
| Other values (22) | 409722 |
Common
| Value | Count | Frequency (%) |
| 568327 | ||
| ) | 290700 | |
| ( | 290700 | |
| % | 564 | < 0.1% |
| 9 | 336 | < 0.1% |
| 6 | 336 | < 0.1% |
| 7 | 228 | < 0.1% |
| 0 | 228 | < 0.1% |
| > | 217 | < 0.1% |
| & | 37 | < 0.1% |
| Other values (2) | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4929388 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 633806 | |
| 568327 | ||
| s | 523666 | |
| n | 456393 | |
| k | 398662 | |
| d | 395200 | |
| ) | 290700 | 5.9% |
| ( | 290700 | 5.9% |
| e | 260863 | 5.3% |
| r | 234948 | 4.8% |
| Other values (34) | 876123 |
associatedTaxa
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 33.3% |
| Missing | 290895 |
| Missing (%) | > 99.9% |
| Memory size | 2.2 MiB |
Length
| Max length | 64 |
|---|---|
| Median length | 64 |
| Mean length | 64 |
| Min length | 64 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | has parasite: Cirrophthirius cf. recurvirostrae | Quadraceps sp. |
|---|---|
| 2nd row | has parasite: Cirrophthirius cf. recurvirostrae | Quadraceps sp. |
| 3rd row | has parasite: Cirrophthirius cf. recurvirostrae | Quadraceps sp. |
| Value | Count | Frequency (%) |
| has | 3 | |
| parasite | 3 | |
| cirrophthirius | 3 | |
| cf | 3 | |
| recurvirostrae | 3 | |
| 3 | ||
| quadraceps | 3 | |
| sp | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 27 | |
| 21 | ||
| s | 18 | |
| a | 18 | |
| i | 15 | 7.8% |
| p | 12 | 6.2% |
| e | 12 | 6.2% |
| h | 9 | 4.7% |
| t | 9 | 4.7% |
| u | 9 | 4.7% |
| Other values (10) | 42 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 153 | |
| Space Separator | 21 | 10.9% |
| Other Punctuation | 9 | 4.7% |
| Uppercase Letter | 6 | 3.1% |
| Math Symbol | 3 | 1.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 27 | |
| s | 18 | |
| a | 18 | |
| i | 15 | |
| p | 12 | |
| e | 12 | |
| h | 9 | 5.9% |
| t | 9 | 5.9% |
| u | 9 | 5.9% |
| c | 9 | 5.9% |
| Other values (4) | 15 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6 | |
| : | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Q | 3 | |
| C | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 21 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 159 | |
| Common | 33 | 17.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 27 | |
| s | 18 | |
| a | 18 | |
| i | 15 | |
| p | 12 | |
| e | 12 | |
| h | 9 | 5.7% |
| t | 9 | 5.7% |
| u | 9 | 5.7% |
| c | 9 | 5.7% |
| Other values (6) | 21 |
Common
| Value | Count | Frequency (%) |
| 21 | ||
| . | 6 | 18.2% |
| | | 3 | 9.1% |
| : | 3 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 192 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 27 | |
| 21 | ||
| s | 18 | |
| a | 18 | |
| i | 15 | 7.8% |
| p | 12 | 6.2% |
| e | 12 | 6.2% |
| h | 9 | 4.7% |
| t | 9 | 4.7% |
| u | 9 | 4.7% |
| Other values (10) | 42 |
eventDate
Text
Missing 
| Distinct | 44850 |
|---|---|
| Distinct (%) | 20.7% |
| Missing | 74430 |
| Missing (%) | 25.6% |
| Memory size | 2.2 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 11.38132657 |
| Min length | 10 |
Unique
| Unique | 12115 ? |
|---|---|
| Unique (%) | 5.6% |
Sample
| 1st row | 1904-07-15 |
|---|---|
| 2nd row | 1887-11-19 |
| 3rd row | 2014-01-05 |
| 4th row | 2008-09-09 |
| 5th row | 2006-04-22 |
| Value | Count | Frequency (%) |
| 1875-10-01/1875-10-31 | 571 | 0.3% |
| 1901-01-01/1901-12-31 | 442 | 0.2% |
| 1930-01-01/1951-12-31 | 384 | 0.2% |
| 1912-01-01/1916-12-31 | 312 | 0.1% |
| 1820-12-01/1821-09-30 | 311 | 0.1% |
| 1862-01-01/1862-12-31 | 295 | 0.1% |
| 1903-01-01/1908-12-31 | 285 | 0.1% |
| 1868-01-01/1868-12-31 | 283 | 0.1% |
| 1982-01-01/1982-12-31 | 260 | 0.1% |
| 1861-01-01/1861-12-31 | 242 | 0.1% |
| Other values (44840) | 213083 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 535271 | |
| - | 487302 | |
| 0 | 371646 | |
| 9 | 254117 | |
| 2 | 178899 | 7.3% |
| 8 | 135169 | 5.5% |
| 3 | 113107 | 4.6% |
| 6 | 104773 | 4.3% |
| 5 | 96538 | 3.9% |
| 7 | 81317 | 3.3% |
| Other values (2) | 105554 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1949208 | |
| Dash Punctuation | 487302 | 19.8% |
| Other Punctuation | 27183 | 1.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 535271 | |
| 0 | 371646 | |
| 9 | 254117 | |
| 2 | 178899 | 9.2% |
| 8 | 135169 | 6.9% |
| 3 | 113107 | 5.8% |
| 6 | 104773 | 5.4% |
| 5 | 96538 | 5.0% |
| 7 | 81317 | 4.2% |
| 4 | 78371 | 4.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 487302 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 27183 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2463693 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 535271 | |
| - | 487302 | |
| 0 | 371646 | |
| 9 | 254117 | |
| 2 | 178899 | 7.3% |
| 8 | 135169 | 5.5% |
| 3 | 113107 | 4.6% |
| 6 | 104773 | 4.3% |
| 5 | 96538 | 3.9% |
| 7 | 81317 | 3.3% |
| Other values (2) | 105554 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2463693 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 535271 | |
| - | 487302 | |
| 0 | 371646 | |
| 9 | 254117 | |
| 2 | 178899 | 7.3% |
| 8 | 135169 | 5.5% |
| 3 | 113107 | 4.6% |
| 6 | 104773 | 4.3% |
| 5 | 96538 | 3.9% |
| 7 | 81317 | 3.3% |
| Other values (2) | 105554 | 4.3% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 74430 |
| Missing (%) | 25.6% |
| Memory size | 2.2 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.633331485 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 197 |
|---|---|
| 2nd row | 323 |
| 3rd row | 5 |
| 4th row | 253 |
| 5th row | 112 |
| Value | Count | Frequency (%) |
| 1 | 13089 | 6.0% |
| 121 | 2472 | 1.1% |
| 274 | 1859 | 0.9% |
| 91 | 1681 | 0.8% |
| 152 | 1646 | 0.8% |
| 60 | 1523 | 0.7% |
| 32 | 1509 | 0.7% |
| 122 | 1452 | 0.7% |
| 153 | 1253 | 0.6% |
| 305 | 1230 | 0.6% |
| Other values (356) | 188754 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 130062 | |
| 2 | 94786 | |
| 3 | 75811 | |
| 4 | 44773 | 7.9% |
| 5 | 42295 | 7.4% |
| 6 | 39955 | 7.0% |
| 0 | 36420 | 6.4% |
| 7 | 36016 | 6.3% |
| 8 | 35025 | 6.1% |
| 9 | 34889 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 570032 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 130062 | |
| 2 | 94786 | |
| 3 | 75811 | |
| 4 | 44773 | 7.9% |
| 5 | 42295 | 7.4% |
| 6 | 39955 | 7.0% |
| 0 | 36420 | 6.4% |
| 7 | 36016 | 6.3% |
| 8 | 35025 | 6.1% |
| 9 | 34889 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 570032 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 130062 | |
| 2 | 94786 | |
| 3 | 75811 | |
| 4 | 44773 | 7.9% |
| 5 | 42295 | 7.4% |
| 6 | 39955 | 7.0% |
| 0 | 36420 | 6.4% |
| 7 | 36016 | 6.3% |
| 8 | 35025 | 6.1% |
| 9 | 34889 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 570032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 130062 | |
| 2 | 94786 | |
| 3 | 75811 | |
| 4 | 44773 | 7.9% |
| 5 | 42295 | 7.4% |
| 6 | 39955 | 7.0% |
| 0 | 36420 | 6.4% |
| 7 | 36016 | 6.3% |
| 8 | 35025 | 6.1% |
| 9 | 34889 | 6.1% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 74430 |
| Missing (%) | 25.6% |
| Memory size | 2.2 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.752028013 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 197 |
|---|---|
| 2nd row | 323 |
| 3rd row | 5 |
| 4th row | 253 |
| 5th row | 112 |
| Value | Count | Frequency (%) |
| 365 | 9577 | 4.4% |
| 366 | 3449 | 1.6% |
| 120 | 1930 | 0.9% |
| 304 | 1835 | 0.8% |
| 151 | 1825 | 0.8% |
| 273 | 1697 | 0.8% |
| 90 | 1423 | 0.7% |
| 121 | 1386 | 0.6% |
| 181 | 1326 | 0.6% |
| 59 | 1319 | 0.6% |
| Other values (356) | 190701 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 118478 | |
| 2 | 92661 | |
| 3 | 89296 | |
| 6 | 54005 | |
| 5 | 51255 | |
| 4 | 45056 | 7.6% |
| 0 | 38299 | 6.4% |
| 7 | 35879 | 6.0% |
| 9 | 35493 | 6.0% |
| 8 | 35304 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 595726 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 118478 | |
| 2 | 92661 | |
| 3 | 89296 | |
| 6 | 54005 | |
| 5 | 51255 | |
| 4 | 45056 | 7.6% |
| 0 | 38299 | 6.4% |
| 7 | 35879 | 6.0% |
| 9 | 35493 | 6.0% |
| 8 | 35304 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 595726 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 118478 | |
| 2 | 92661 | |
| 3 | 89296 | |
| 6 | 54005 | |
| 5 | 51255 | |
| 4 | 45056 | 7.6% |
| 0 | 38299 | 6.4% |
| 7 | 35879 | 6.0% |
| 9 | 35493 | 6.0% |
| 8 | 35304 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 595726 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 118478 | |
| 2 | 92661 | |
| 3 | 89296 | |
| 6 | 54005 | |
| 5 | 51255 | |
| 4 | 45056 | 7.6% |
| 0 | 38299 | 6.4% |
| 7 | 35879 | 6.0% |
| 9 | 35493 | 6.0% |
| 8 | 35304 | 5.9% |
year
Text
Missing 
| Distinct | 227 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 78830 |
| Missing (%) | 27.1% |
| Memory size | 2.2 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1904 |
|---|---|
| 2nd row | 1887 |
| 3rd row | 2014 |
| 4th row | 2008 |
| 5th row | 2006 |
| Value | Count | Frequency (%) |
| 1909 | 4335 | 2.0% |
| 1910 | 4083 | 1.9% |
| 1913 | 3479 | 1.6% |
| 1912 | 3322 | 1.6% |
| 1920 | 3154 | 1.5% |
| 1908 | 3022 | 1.4% |
| 1907 | 2956 | 1.4% |
| 1911 | 2923 | 1.4% |
| 1968 | 2883 | 1.4% |
| 1919 | 2827 | 1.3% |
| Other values (217) | 179084 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 260246 | |
| 9 | 199374 | |
| 8 | 79743 | 9.4% |
| 6 | 55292 | 6.5% |
| 0 | 54406 | 6.4% |
| 2 | 47998 | 5.7% |
| 7 | 41819 | 4.9% |
| 5 | 39221 | 4.6% |
| 3 | 37532 | 4.4% |
| 4 | 32641 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 848272 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 260246 | |
| 9 | 199374 | |
| 8 | 79743 | 9.4% |
| 6 | 55292 | 6.5% |
| 0 | 54406 | 6.4% |
| 2 | 47998 | 5.7% |
| 7 | 41819 | 4.9% |
| 5 | 39221 | 4.6% |
| 3 | 37532 | 4.4% |
| 4 | 32641 | 3.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 848272 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 260246 | |
| 9 | 199374 | |
| 8 | 79743 | 9.4% |
| 6 | 55292 | 6.5% |
| 0 | 54406 | 6.4% |
| 2 | 47998 | 5.7% |
| 7 | 41819 | 4.9% |
| 5 | 39221 | 4.6% |
| 3 | 37532 | 4.4% |
| 4 | 32641 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 848272 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 260246 | |
| 9 | 199374 | |
| 8 | 79743 | 9.4% |
| 6 | 55292 | 6.5% |
| 0 | 54406 | 6.4% |
| 2 | 47998 | 5.7% |
| 7 | 41819 | 4.9% |
| 5 | 39221 | 4.6% |
| 3 | 37532 | 4.4% |
| 4 | 32641 | 3.8% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 87276 |
| Missing (%) | 30.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.22423412 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 7 |
|---|---|
| 2nd row | 11 |
| 3rd row | 1 |
| 4th row | 9 |
| 5th row | 4 |
| Value | Count | Frequency (%) |
| 5 | 29179 | |
| 4 | 21061 | |
| 6 | 20812 | |
| 10 | 17835 | |
| 3 | 16215 | |
| 11 | 14949 | |
| 9 | 14742 | |
| 1 | 14411 | |
| 2 | 14384 | |
| 7 | 13920 | |
| Other values (2) | 26114 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 75019 | |
| 5 | 29179 | 11.7% |
| 2 | 27259 | 10.9% |
| 4 | 21061 | 8.4% |
| 6 | 20812 | 8.3% |
| 0 | 17835 | 7.2% |
| 3 | 16215 | 6.5% |
| 9 | 14742 | 5.9% |
| 7 | 13920 | 5.6% |
| 8 | 13239 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 249281 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 75019 | |
| 5 | 29179 | 11.7% |
| 2 | 27259 | 10.9% |
| 4 | 21061 | 8.4% |
| 6 | 20812 | 8.3% |
| 0 | 17835 | 7.2% |
| 3 | 16215 | 6.5% |
| 9 | 14742 | 5.9% |
| 7 | 13920 | 5.6% |
| 8 | 13239 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 249281 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 75019 | |
| 5 | 29179 | 11.7% |
| 2 | 27259 | 10.9% |
| 4 | 21061 | 8.4% |
| 6 | 20812 | 8.3% |
| 0 | 17835 | 7.2% |
| 3 | 16215 | 6.5% |
| 9 | 14742 | 5.9% |
| 7 | 13920 | 5.6% |
| 8 | 13239 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 249281 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 75019 | |
| 5 | 29179 | 11.7% |
| 2 | 27259 | 10.9% |
| 4 | 21061 | 8.4% |
| 6 | 20812 | 8.3% |
| 0 | 17835 | 7.2% |
| 3 | 16215 | 6.5% |
| 9 | 14742 | 5.9% |
| 7 | 13920 | 5.6% |
| 8 | 13239 | 5.3% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 101613 |
| Missing (%) | 34.9% |
| Memory size | 2.2 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.704292469 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 15 |
|---|---|
| 2nd row | 19 |
| 3rd row | 5 |
| 4th row | 9 |
| 5th row | 22 |
| Value | Count | Frequency (%) |
| 15 | 7094 | 3.7% |
| 1 | 7019 | 3.7% |
| 10 | 6933 | 3.7% |
| 20 | 6929 | 3.7% |
| 18 | 6540 | 3.5% |
| 5 | 6461 | 3.4% |
| 12 | 6454 | 3.4% |
| 16 | 6383 | 3.4% |
| 25 | 6370 | 3.4% |
| 2 | 6356 | 3.4% |
| Other values (21) | 122746 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 84961 | |
| 2 | 80139 | |
| 3 | 26438 | 8.2% |
| 5 | 19925 | 6.2% |
| 0 | 19527 | 6.1% |
| 6 | 18700 | 5.8% |
| 8 | 18677 | 5.8% |
| 7 | 18412 | 5.7% |
| 4 | 18316 | 5.7% |
| 9 | 17502 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 322597 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 84961 | |
| 2 | 80139 | |
| 3 | 26438 | 8.2% |
| 5 | 19925 | 6.2% |
| 0 | 19527 | 6.1% |
| 6 | 18700 | 5.8% |
| 8 | 18677 | 5.8% |
| 7 | 18412 | 5.7% |
| 4 | 18316 | 5.7% |
| 9 | 17502 | 5.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 322597 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 84961 | |
| 2 | 80139 | |
| 3 | 26438 | 8.2% |
| 5 | 19925 | 6.2% |
| 0 | 19527 | 6.1% |
| 6 | 18700 | 5.8% |
| 8 | 18677 | 5.8% |
| 7 | 18412 | 5.7% |
| 4 | 18316 | 5.7% |
| 9 | 17502 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 322597 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 84961 | |
| 2 | 80139 | |
| 3 | 26438 | 8.2% |
| 5 | 19925 | 6.2% |
| 0 | 19527 | 6.1% |
| 6 | 18700 | 5.8% |
| 8 | 18677 | 5.8% |
| 7 | 18412 | 5.7% |
| 4 | 18316 | 5.7% |
| 9 | 17502 | 5.4% |
Missing 
| Distinct | 75505 |
|---|---|
| Distinct (%) | 32.7% |
| Missing | 59902 |
| Missing (%) | 20.6% |
| Memory size | 2.2 MiB |
Length
| Max length | 255 |
|---|---|
| Median length | 10 |
| Mean length | 10.37595889 |
| Min length | 1 |
Unique
| Unique | 36235 ? |
|---|---|
| Unique (%) | 15.7% |
Sample
| 1st row | 15/7/1904 |
|---|---|
| 2nd row | 19-11-1887 |
| 3rd row | before 1880 |
| 4th row | 5 januari 2014 |
| 5th row | 9 september 2008 |
| Value | Count | Frequency (%) |
| 5954 | 2.0% | |
| on | 4818 | 1.6% |
| label | 4338 | 1.5% |
| may | 2008 | 0.7% |
| april | 1649 | 0.6% |
| september | 1517 | 0.5% |
| october | 1257 | 0.4% |
| june | 1251 | 0.4% |
| december | 1227 | 0.4% |
| november | 1156 | 0.4% |
| Other values (69619) | 268785 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 471619 | |
| - | 339877 | |
| 9 | 255264 | |
| 0 | 218095 | |
| 2 | 170497 | 7.1% |
| 8 | 129945 | 5.4% |
| 6 | 103678 | 4.3% |
| 5 | 96019 | 4.0% |
| 3 | 94216 | 3.9% |
| / | 82976 | 3.5% |
| Other values (90) | 434619 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1700058 | |
| Dash Punctuation | 339878 | 14.2% |
| Lowercase Letter | 165606 | 6.9% |
| Other Punctuation | 99666 | 4.2% |
| Space Separator | 64354 | 2.7% |
| Uppercase Letter | 26074 | 1.1% |
| Math Symbol | 629 | < 0.1% |
| Open Punctuation | 269 | < 0.1% |
| Close Punctuation | 267 | < 0.1% |
| Modifier Symbol | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 26055 | |
| a | 16564 | |
| r | 15695 | |
| l | 15620 | |
| b | 12889 | 7.8% |
| n | 11626 | 7.0% |
| u | 9103 | 5.5% |
| o | 8700 | 5.3% |
| t | 7075 | 4.3% |
| i | 6959 | 4.2% |
| Other values (26) | 35320 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 4293 | |
| O | 4247 | |
| J | 3844 | |
| A | 2880 | |
| N | 1997 | |
| D | 1853 | |
| S | 1736 | |
| I | 1097 | 4.2% |
| F | 1089 | 4.2% |
| H | 788 | 3.0% |
| Other values (16) | 2250 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 82976 | |
| , | 5609 | 5.6% |
| : | 5189 | 5.2% |
| . | 3974 | 4.0% |
| ' | 733 | 0.7% |
| \ | 686 | 0.7% |
| ? | 373 | 0.4% |
| " | 49 | < 0.1% |
| ; | 34 | < 0.1% |
| ! | 24 | < 0.1% |
| Other values (3) | 19 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 471619 | |
| 9 | 255264 | |
| 0 | 218095 | |
| 2 | 170497 | 10.0% |
| 8 | 129945 | 7.6% |
| 6 | 103678 | 6.1% |
| 5 | 96019 | 5.6% |
| 3 | 94216 | 5.5% |
| 7 | 81858 | 4.8% |
| 4 | 78867 | 4.6% |
Math Symbol
| Value | Count | Frequency (%) |
| ± | 585 | |
| > | 16 | 2.5% |
| < | 14 | 2.2% |
| + | 10 | 1.6% |
| = | 4 | 0.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 339877 | |
| – | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 185 | |
| [ | 84 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 184 | |
| ] | 83 |
Space Separator
| Value | Count | Frequency (%) |
| 64354 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 2 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2205125 | |
| Latin | 191680 | 8.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 26055 | |
| a | 16564 | 8.6% |
| r | 15695 | 8.2% |
| l | 15620 | 8.1% |
| b | 12889 | 6.7% |
| n | 11626 | 6.1% |
| u | 9103 | 4.7% |
| o | 8700 | 4.5% |
| t | 7075 | 3.7% |
| i | 6959 | 3.6% |
| Other values (52) | 61394 |
Common
| Value | Count | Frequency (%) |
| 1 | 471619 | |
| - | 339877 | |
| 9 | 255264 | |
| 0 | 218095 | |
| 2 | 170497 | 7.7% |
| 8 | 129945 | 5.9% |
| 6 | 103678 | 4.7% |
| 5 | 96019 | 4.4% |
| 3 | 94216 | 4.3% |
| / | 82976 | 3.8% |
| Other values (28) | 242939 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2396058 | |
| None | 739 | < 0.1% |
| Punctuation | 8 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 471619 | |
| - | 339877 | |
| 9 | 255264 | |
| 0 | 218095 | |
| 2 | 170497 | 7.1% |
| 8 | 129945 | 5.4% |
| 6 | 103678 | 4.3% |
| 5 | 96019 | 4.0% |
| 3 | 94216 | 3.9% |
| / | 82976 | 3.5% |
| Other values (75) | 433872 |
None
| Value | Count | Frequency (%) |
| ± | 585 | |
| ü | 63 | 8.5% |
| é | 35 | 4.7% |
| ä | 28 | 3.8% |
| â | 16 | 2.2% |
| ó | 4 | 0.5% |
| ´ | 2 | 0.3% |
| ï | 1 | 0.1% |
| ò | 1 | 0.1% |
| ½ | 1 | 0.1% |
| Other values (3) | 3 | 0.4% |
Punctuation
| Value | Count | Frequency (%) |
| … | 7 | |
| – | 1 | 12.5% |
continent
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 94391 |
| Missing (%) | 32.4% |
| Memory size | 2.2 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.659584646 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EUROPE |
|---|---|
| 2nd row | OCEANIA |
| 3rd row | OCEANIA |
| 4th row | OCEANIA |
| 5th row | AFRICA |
| Value | Count | Frequency (%) |
| europe | 81421 | |
| asia | 55363 | |
| south_america | 25346 | 12.9% |
| africa | 17201 | 8.8% |
| oceania | 9475 | 4.8% |
| north_america | 7546 | 3.8% |
| antarctica | 155 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 230327 | |
| E | 205209 | |
| R | 139215 | |
| O | 123788 | |
| I | 115086 | |
| U | 106767 | |
| P | 81421 | 6.2% |
| S | 80709 | 6.2% |
| C | 59878 | 4.6% |
| T | 33202 | 2.5% |
| Other values (5) | 133053 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1275763 | |
| Connector Punctuation | 32892 | 2.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 230327 | |
| E | 205209 | |
| R | 139215 | |
| O | 123788 | |
| I | 115086 | |
| U | 106767 | |
| P | 81421 | 6.4% |
| S | 80709 | 6.3% |
| C | 59878 | 4.7% |
| T | 33202 | 2.6% |
| Other values (4) | 100161 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 32892 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1275763 | |
| Common | 32892 | 2.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 230327 | |
| E | 205209 | |
| R | 139215 | |
| O | 123788 | |
| I | 115086 | |
| U | 106767 | |
| P | 81421 | 6.4% |
| S | 80709 | 6.3% |
| C | 59878 | 4.7% |
| T | 33202 | 2.6% |
| Other values (4) | 100161 |
Common
| Value | Count | Frequency (%) |
| _ | 32892 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1308655 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 230327 | |
| E | 205209 | |
| R | 139215 | |
| O | 123788 | |
| I | 115086 | |
| U | 106767 | |
| P | 81421 | 6.2% |
| S | 80709 | 6.2% |
| C | 59878 | 4.6% |
| T | 33202 | 2.5% |
| Other values (5) | 133053 |
island
Text
Missing 
| Distinct | 1622 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 200600 |
| Missing (%) | 69.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 49 |
|---|---|
| Median length | 47 |
| Mean length | 6.738764978 |
| Min length | 3 |
Unique
| Unique | 702 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | South Island |
|---|---|
| 2nd row | Vlieland |
| 3rd row | Moluccas |
| 4th row | Moluccas |
| 5th row | Moluccas |
| Value | Count | Frequency (%) |
| java | 34511 | |
| sumatra | 10786 | 10.1% |
| celebes | 5435 | 5.1% |
| guinea | 4561 | 4.3% |
| new | 3784 | 3.5% |
| borneo | 3686 | 3.4% |
| islands | 3176 | 3.0% |
| texel | 2904 | 2.7% |
| sunda | 2297 | 2.1% |
| lesser | 2296 | 2.1% |
| Other values (1286) | 33756 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 134860 | |
| e | 54105 | 8.9% |
| v | 35047 | 5.8% |
| J | 34784 | 5.7% |
| r | 30733 | 5.1% |
| n | 29173 | 4.8% |
| u | 26669 | 4.4% |
| s | 25626 | 4.2% |
| l | 23733 | 3.9% |
| o | 22092 | 3.6% |
| Other values (75) | 191675 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 481547 | |
| Uppercase Letter | 107161 | 17.6% |
| Space Separator | 16894 | 2.8% |
| Other Punctuation | 1792 | 0.3% |
| Open Punctuation | 391 | 0.1% |
| Close Punctuation | 391 | 0.1% |
| Dash Punctuation | 318 | 0.1% |
| Decimal Number | 2 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 134860 | |
| e | 54105 | |
| v | 35047 | 7.3% |
| r | 30733 | 6.4% |
| n | 29173 | 6.1% |
| u | 26669 | 5.5% |
| s | 25626 | 5.3% |
| l | 23733 | 4.9% |
| o | 22092 | 4.6% |
| i | 18605 | 3.9% |
| Other values (34) | 80904 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 34784 | |
| S | 17177 | |
| C | 8035 | 7.5% |
| B | 6776 | 6.3% |
| T | 5863 | 5.5% |
| G | 5613 | 5.2% |
| I | 5488 | 5.1% |
| N | 5301 | 4.9% |
| M | 4501 | 4.2% |
| L | 3719 | 3.5% |
| Other values (17) | 9904 | 9.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1136 | |
| , | 606 | |
| ? | 28 | 1.6% |
| ' | 19 | 1.1% |
| / | 3 | 0.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 349 | |
| ( | 42 | 10.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 349 | |
| ) | 42 | 10.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 16894 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 318 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 588708 | |
| Common | 19789 | 3.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 134860 | |
| e | 54105 | 9.2% |
| v | 35047 | 6.0% |
| J | 34784 | 5.9% |
| r | 30733 | 5.2% |
| n | 29173 | 5.0% |
| u | 26669 | 4.5% |
| s | 25626 | 4.4% |
| l | 23733 | 4.0% |
| o | 22092 | 3.8% |
| Other values (61) | 171886 |
Common
| Value | Count | Frequency (%) |
| 16894 | ||
| . | 1136 | 5.7% |
| , | 606 | 3.1% |
| [ | 349 | 1.8% |
| ] | 349 | 1.8% |
| - | 318 | 1.6% |
| ( | 42 | 0.2% |
| ) | 42 | 0.2% |
| ? | 28 | 0.1% |
| ' | 19 | 0.1% |
| Other values (4) | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 606505 | |
| None | 1992 | 0.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 134860 | |
| e | 54105 | 8.9% |
| v | 35047 | 5.8% |
| J | 34784 | 5.7% |
| r | 30733 | 5.1% |
| n | 29173 | 4.8% |
| u | 26669 | 4.4% |
| s | 25626 | 4.2% |
| l | 23733 | 3.9% |
| o | 22092 | 3.6% |
| Other values (55) | 189683 |
None
| Value | Count | Frequency (%) |
| ç | 1160 | |
| ë | 262 | 13.2% |
| é | 198 | 9.9% |
| ø | 169 | 8.5% |
| ö | 100 | 5.0% |
| Ö | 40 | 2.0% |
| á | 11 | 0.6% |
| ü | 11 | 0.6% |
| ã | 9 | 0.5% |
| í | 9 | 0.5% |
| Other values (10) | 23 | 1.2% |
countryCode
Text
Missing 
| Distinct | 219 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 47203 |
| Missing (%) | 16.2% |
| Memory size | 2.2 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NL |
|---|---|
| 2nd row | AU |
| 3rd row | AU |
| 4th row | AU |
| 5th row | SN |
| Value | Count | Frequency (%) |
| id | 77470 | |
| nl | 69474 | |
| sr | 13923 | 5.7% |
| ke | 3747 | 1.5% |
| br | 3554 | 1.5% |
| us | 3540 | 1.5% |
| zz | 3536 | 1.5% |
| au | 3349 | 1.4% |
| co | 3153 | 1.3% |
| tw | 2777 | 1.1% |
| Other values (209) | 59172 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 82774 | |
| D | 81393 | |
| N | 76476 | |
| L | 74514 | |
| R | 24587 | 5.0% |
| S | 23098 | 4.7% |
| Z | 14189 | 2.9% |
| E | 13028 | 2.7% |
| T | 11985 | 2.5% |
| C | 10224 | 2.1% |
| Other values (16) | 75122 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 487390 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 82774 | |
| D | 81393 | |
| N | 76476 | |
| L | 74514 | |
| R | 24587 | 5.0% |
| S | 23098 | 4.7% |
| Z | 14189 | 2.9% |
| E | 13028 | 2.7% |
| T | 11985 | 2.5% |
| C | 10224 | 2.1% |
| Other values (16) | 75122 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 487390 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 82774 | |
| D | 81393 | |
| N | 76476 | |
| L | 74514 | |
| R | 24587 | 5.0% |
| S | 23098 | 4.7% |
| Z | 14189 | 2.9% |
| E | 13028 | 2.7% |
| T | 11985 | 2.5% |
| C | 10224 | 2.1% |
| Other values (16) | 75122 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 487390 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 82774 | |
| D | 81393 | |
| N | 76476 | |
| L | 74514 | |
| R | 24587 | 5.0% |
| S | 23098 | 4.7% |
| Z | 14189 | 2.9% |
| E | 13028 | 2.7% |
| T | 11985 | 2.5% |
| C | 10224 | 2.1% |
| Other values (16) | 75122 |
stateProvince
Text
Missing 
| Distinct | 7178 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 137182 |
| Missing (%) | 47.2% |
| Memory size | 2.2 MiB |
Length
| Max length | 80 |
|---|---|
| Median length | 71 |
| Mean length | 11.67551849 |
| Min length | 1 |
Unique
| Unique | 3142 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | South Holland |
|---|---|
| 2nd row | New South Wales |
| 3rd row | South Australia |
| 4th row | Queensland |
| 5th row | Friesland |
| Value | Count | Frequency (%) |
| holland | 26804 | 10.7% |
| north | 19049 | 7.6% |
| south | 12974 | 5.2% |
| preanger | 9164 | 3.7% |
| java | 8838 | 3.5% |
| gelderland | 6562 | 2.6% |
| friesland | 4328 | 1.7% |
| guinea | 4302 | 1.7% |
| overijssel | 3400 | 1.4% |
| utrecht | 3321 | 1.3% |
| Other values (5330) | 151490 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 201596 | 11.2% |
| e | 142421 | 7.9% |
| n | 125543 | 7.0% |
| r | 122104 | 6.8% |
| l | 121317 | 6.8% |
| o | 110985 | 6.2% |
| 96516 | 5.4% | |
| t | 83085 | 4.6% |
| i | 75782 | 4.2% |
| d | 74750 | 4.2% |
| Other values (106) | 640615 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1393309 | |
| Uppercase Letter | 254898 | 14.2% |
| Space Separator | 96516 | 5.4% |
| Other Punctuation | 36550 | 2.0% |
| Dash Punctuation | 10986 | 0.6% |
| Close Punctuation | 1011 | 0.1% |
| Open Punctuation | 1010 | 0.1% |
| Decimal Number | 229 | < 0.1% |
| Math Symbol | 201 | < 0.1% |
| Other Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 201596 | |
| e | 142421 | |
| n | 125543 | |
| r | 122104 | |
| l | 121317 | |
| o | 110985 | |
| t | 83085 | 6.0% |
| i | 75782 | 5.4% |
| d | 74750 | 5.4% |
| s | 60558 | 4.3% |
| Other values (42) | 275168 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 33640 | |
| H | 31015 | |
| S | 26906 | 10.6% |
| P | 18598 | 7.3% |
| G | 16799 | 6.6% |
| B | 13084 | 5.1% |
| C | 11351 | 4.5% |
| W | 10922 | 4.3% |
| J | 10679 | 4.2% |
| M | 10545 | 4.1% |
| Other values (21) | 71359 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 92 | |
| 1 | 62 | |
| 5 | 13 | 5.7% |
| 6 | 13 | 5.7% |
| 4 | 13 | 5.7% |
| 2 | 11 | 4.8% |
| 9 | 10 | 4.4% |
| 3 | 7 | 3.1% |
| 7 | 4 | 1.7% |
| 8 | 4 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 19171 | |
| . | 16675 | |
| / | 201 | 0.5% |
| & | 170 | 0.5% |
| ' | 157 | 0.4% |
| : | 113 | 0.3% |
| ? | 52 | 0.1% |
| " | 8 | < 0.1% |
| ; | 3 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 83 | |
| < | 83 | |
| ± | 27 | 13.4% |
| = | 8 | 4.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 779 | |
| ) | 231 | 22.8% |
| } | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 777 | |
| ( | 232 | 23.0% |
| { | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 96516 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10986 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 2 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1648207 | |
| Common | 146507 | 8.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 201596 | 12.2% |
| e | 142421 | 8.6% |
| n | 125543 | 7.6% |
| r | 122104 | 7.4% |
| l | 121317 | 7.4% |
| o | 110985 | 6.7% |
| t | 83085 | 5.0% |
| i | 75782 | 4.6% |
| d | 74750 | 4.5% |
| s | 60558 | 3.7% |
| Other values (73) | 530066 |
Common
| Value | Count | Frequency (%) |
| 96516 | ||
| , | 19171 | 13.1% |
| . | 16675 | 11.4% |
| - | 10986 | 7.5% |
| ] | 779 | 0.5% |
| [ | 777 | 0.5% |
| ( | 232 | 0.2% |
| ) | 231 | 0.2% |
| / | 201 | 0.1% |
| & | 170 | 0.1% |
| Other values (23) | 769 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1788335 | |
| None | 6379 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 201596 | 11.3% |
| e | 142421 | 8.0% |
| n | 125543 | 7.0% |
| r | 122104 | 6.8% |
| l | 121317 | 6.8% |
| o | 110985 | 6.2% |
| 96516 | 5.4% | |
| t | 83085 | 4.6% |
| i | 75782 | 4.2% |
| d | 74750 | 4.2% |
| Other values (73) | 634236 |
None
| Value | Count | Frequency (%) |
| â | 2265 | |
| ë | 2122 | |
| ä | 509 | 8.0% |
| é | 410 | 6.4% |
| ü | 208 | 3.3% |
| ô | 192 | 3.0% |
| ö | 128 | 2.0% |
| è | 126 | 2.0% |
| á | 90 | 1.4% |
| å | 55 | 0.9% |
| Other values (23) | 274 | 4.3% |
locality
Text
Missing 
| Distinct | 29704 |
|---|---|
| Distinct (%) | 14.1% |
| Missing | 79647 |
| Missing (%) | 27.4% |
| Memory size | 2.2 MiB |
Length
| Max length | 35460 |
|---|---|
| Median length | 93 |
| Mean length | 16.32246001 |
| Min length | 2 |
Unique
| Unique | 16268 ? |
|---|---|
| Unique (%) | 7.7% |
Sample
| 1st row | Lisse |
|---|---|
| 2nd row | New South Wales, no further locality |
| 3rd row | Kangaroo I. |
| 4th row | sine loco [SW & SE Australia] |
| 5th row | Senegal, no further locality |
| Value | Count | Frequency (%) |
| locality | 9277 | 1.9% |
| no | 9263 | 1.9% |
| further | 9250 | 1.9% |
| i | 8571 | 1.8% |
| java | 8148 | 1.7% |
| sine | 6339 | 1.3% |
| loco | 6337 | 1.3% |
| west | 5995 | 1.2% |
| area | 5203 | 1.1% |
| pangerango | 4791 | 1.0% |
| Other values (25114) | 414054 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 341144 | 9.9% |
| e | 314678 | 9.1% |
| 273926 | 7.9% | |
| n | 233745 | 6.8% |
| r | 210253 | 6.1% |
| o | 208503 | 6.0% |
| i | 173894 | 5.0% |
| t | 132266 | 3.8% |
| l | 130259 | 3.8% |
| s | 107689 | 3.1% |
| Other values (125) | 1321779 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2573475 | |
| Uppercase Letter | 400106 | 11.6% |
| Space Separator | 273926 | 7.9% |
| Other Punctuation | 115906 | 3.4% |
| Decimal Number | 23484 | 0.7% |
| Close Punctuation | 18991 | 0.6% |
| Open Punctuation | 18990 | 0.6% |
| Dash Punctuation | 11570 | 0.3% |
| Control | 7136 | 0.2% |
| Math Symbol | 3131 | 0.1% |
| Other values (7) | 1421 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 341144 | |
| e | 314678 | |
| n | 233745 | 9.1% |
| r | 210253 | 8.2% |
| o | 208503 | 8.1% |
| i | 173894 | 6.8% |
| t | 132266 | 5.1% |
| l | 130259 | 5.1% |
| s | 107689 | 4.2% |
| u | 106314 | 4.1% |
| Other values (44) | 614730 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 39993 | 10.0% |
| B | 32641 | 8.2% |
| M | 28139 | 7.0% |
| P | 27155 | 6.8% |
| W | 26168 | 6.5% |
| N | 20653 | 5.2% |
| K | 19987 | 5.0% |
| T | 18484 | 4.6% |
| H | 18126 | 4.5% |
| A | 17584 | 4.4% |
| Other values (25) | 151176 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 71200 | |
| . | 22457 | 19.4% |
| ' | 8903 | 7.7% |
| / | 6607 | 5.7% |
| ? | 2997 | 2.6% |
| " | 2017 | 1.7% |
| & | 989 | 0.9% |
| : | 541 | 0.5% |
| ; | 111 | 0.1% |
| ! | 70 | 0.1% |
| Other values (2) | 14 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6579 | |
| 1 | 3671 | |
| 2 | 2875 | |
| 5 | 2515 | 10.7% |
| 3 | 1827 | 7.8% |
| 4 | 1603 | 6.8% |
| 6 | 1170 | 5.0% |
| 8 | 1170 | 5.0% |
| 9 | 1115 | 4.7% |
| 7 | 959 | 4.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1027 | |
| > | 1022 | |
| < | 995 | |
| ± | 50 | 1.6% |
| | | 34 | 1.1% |
| + | 2 | 0.1% |
| ~ | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 12701 | |
| ( | 6288 | |
| { | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 12699 | |
| ) | 6285 | |
| } | 7 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 7104 | ||
| 32 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 273926 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11570 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 615 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 376 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 312 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 64 |
Other Letter
| Value | Count | Frequency (%) |
| º | 38 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 12 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2973619 | |
| Common | 474517 | 13.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 341144 | 11.5% |
| e | 314678 | 10.6% |
| n | 233745 | 7.9% |
| r | 210253 | 7.1% |
| o | 208503 | 7.0% |
| i | 173894 | 5.8% |
| t | 132266 | 4.4% |
| l | 130259 | 4.4% |
| s | 107689 | 3.6% |
| u | 106314 | 3.6% |
| Other values (80) | 1014874 |
Common
| Value | Count | Frequency (%) |
| 273926 | ||
| , | 71200 | 15.0% |
| . | 22457 | 4.7% |
| [ | 12701 | 2.7% |
| ] | 12699 | 2.7% |
| - | 11570 | 2.4% |
| ' | 8903 | 1.9% |
| 7104 | 1.5% | |
| / | 6607 | 1.4% |
| 0 | 6579 | 1.4% |
| Other values (35) | 40771 | 8.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3441460 | |
| None | 6297 | 0.2% |
| Punctuation | 379 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 341144 | 9.9% |
| e | 314678 | 9.1% |
| 273926 | 8.0% | |
| n | 233745 | 6.8% |
| r | 210253 | 6.1% |
| o | 208503 | 6.1% |
| i | 173894 | 5.1% |
| t | 132266 | 3.8% |
| l | 130259 | 3.8% |
| s | 107689 | 3.1% |
| Other values (81) | 1315103 |
None
| Value | Count | Frequency (%) |
| é | 1764 | |
| ö | 718 | |
| ° | 615 | 9.8% |
| ä | 574 | 9.1% |
| â | 465 | 7.4% |
| ü | 379 | 6.0% |
| ë | 338 | 5.4% |
| è | 186 | 3.0% |
| å | 160 | 2.5% |
| Ö | 130 | 2.1% |
| Other values (31) | 968 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 312 | |
| ‘ | 64 | 16.9% |
| … | 3 | 0.8% |
Missing 
| Distinct | 716 |
|---|---|
| Distinct (%) | 27.7% |
| Missing | 288311 |
| Missing (%) | 99.1% |
| Memory size | 2.2 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 27 |
| Mean length | 7.081175106 |
| Min length | 2 |
Unique
| Unique | 421 ? |
|---|---|
| Unique (%) | 16.3% |
Sample
| 1st row | 1700 m. |
|---|---|
| 2nd row | ± 100 Meter |
| 3rd row | ± 100 m |
| 4th row | asc 3000 ft |
| 5th row | 7000' |
| Value | Count | Frequency (%) |
| m | 1564 | |
| meter | 212 | 4.2% |
| ft | 177 | 3.5% |
| ± | 168 | 3.3% |
| 6000 | 137 | 2.7% |
| 7000 | 121 | 2.4% |
| 1000 | 106 | 2.1% |
| 900 | 102 | 2.0% |
| 1800 | 101 | 2.0% |
| 3000 | 101 | 2.0% |
| Other values (358) | 2280 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5678 | |
| 2483 | ||
| m | 1262 | 6.9% |
| 1 | 1022 | 5.6% |
| . | 814 | 4.4% |
| 5 | 685 | 3.7% |
| M | 616 | 3.4% |
| e | 596 | 3.3% |
| ' | 548 | 3.0% |
| 2 | 519 | 2.8% |
| Other values (47) | 4096 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10006 | |
| Lowercase Letter | 3329 | 18.2% |
| Space Separator | 2483 | 13.6% |
| Other Punctuation | 1432 | 7.8% |
| Uppercase Letter | 663 | 3.6% |
| Math Symbol | 202 | 1.1% |
| Dash Punctuation | 194 | 1.1% |
| Open Punctuation | 5 | < 0.1% |
| Close Punctuation | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 1262 | |
| e | 596 | |
| t | 508 | |
| r | 274 | 8.2% |
| f | 214 | 6.4% |
| a | 97 | 2.9% |
| o | 81 | 2.4% |
| s | 62 | 1.9% |
| z | 33 | 1.0% |
| l | 29 | 0.9% |
| Other values (14) | 173 | 5.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 616 | |
| X | 18 | 2.7% |
| F | 9 | 1.4% |
| S | 6 | 0.9% |
| E | 3 | 0.5% |
| H | 3 | 0.5% |
| K | 2 | 0.3% |
| Y | 2 | 0.3% |
| L | 1 | 0.2% |
| V | 1 | 0.2% |
| Other values (2) | 2 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5678 | |
| 1 | 1022 | 10.2% |
| 5 | 685 | 6.8% |
| 2 | 519 | 5.2% |
| 6 | 395 | 3.9% |
| 7 | 387 | 3.9% |
| 8 | 384 | 3.8% |
| 4 | 355 | 3.5% |
| 3 | 345 | 3.4% |
| 9 | 236 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 814 | |
| ' | 548 | |
| , | 66 | 4.6% |
| : | 3 | 0.2% |
| / | 1 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| ± | 196 | |
| + | 6 | 3.0% |
Space Separator
| Value | Count | Frequency (%) |
| 2483 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 194 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14327 | |
| Latin | 3992 | 21.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 1262 | |
| M | 616 | |
| e | 596 | |
| t | 508 | |
| r | 274 | 6.9% |
| f | 214 | 5.4% |
| a | 97 | 2.4% |
| o | 81 | 2.0% |
| s | 62 | 1.6% |
| z | 33 | 0.8% |
| Other values (26) | 249 | 6.2% |
Common
| Value | Count | Frequency (%) |
| 0 | 5678 | |
| 2483 | ||
| 1 | 1022 | 7.1% |
| . | 814 | 5.7% |
| 5 | 685 | 4.8% |
| ' | 548 | 3.8% |
| 2 | 519 | 3.6% |
| 6 | 395 | 2.8% |
| 7 | 387 | 2.7% |
| 8 | 384 | 2.7% |
| Other values (11) | 1412 | 9.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18121 | |
| None | 198 | 1.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5678 | |
| 2483 | ||
| m | 1262 | 7.0% |
| 1 | 1022 | 5.6% |
| . | 814 | 4.5% |
| 5 | 685 | 3.8% |
| M | 616 | 3.4% |
| e | 596 | 3.3% |
| ' | 548 | 3.0% |
| 2 | 519 | 2.9% |
| Other values (45) | 3898 |
None
| Value | Count | Frequency (%) |
| ± | 196 | |
| ü | 2 | 1.0% |
decimalLatitude
Text
Missing 
| Distinct | 8176 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 139112 |
| Missing (%) | 47.8% |
| Memory size | 2.2 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 6.188172822 |
| Min length | 3 |
Unique
| Unique | 2575 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | 52.25 |
|---|---|
| 2nd row | -35.8417 |
| 3rd row | 13.5 |
| 4th row | -45.15267 |
| 5th row | -13.4 |
| Value | Count | Frequency (%) |
| 6.7667 | 1821 | 1.2% |
| 52.2417 | 1258 | 0.8% |
| 6.775 | 1114 | 0.7% |
| 6.5833 | 1111 | 0.7% |
| 52.175 | 953 | 0.6% |
| 5.9417 | 878 | 0.6% |
| 3.5917 | 852 | 0.6% |
| 52.1 | 846 | 0.6% |
| 53.3917 | 843 | 0.6% |
| 52.3583 | 813 | 0.5% |
| Other values (7241) | 141297 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 151786 | |
| 5 | 137622 | |
| 3 | 107967 | |
| 1 | 87724 | |
| 2 | 84224 | |
| 7 | 76782 | |
| 8 | 60464 | 6.4% |
| 6 | 56108 | 6.0% |
| 0 | 51478 | 5.5% |
| 4 | 48645 | 5.2% |
| Other values (2) | 76478 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 742295 | |
| Other Punctuation | 151786 | 16.2% |
| Dash Punctuation | 45197 | 4.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 137622 | |
| 3 | 107967 | |
| 1 | 87724 | |
| 2 | 84224 | |
| 7 | 76782 | |
| 8 | 60464 | |
| 6 | 56108 | |
| 0 | 51478 | 6.9% |
| 4 | 48645 | 6.6% |
| 9 | 31281 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 151786 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 45197 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 939278 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 151786 | |
| 5 | 137622 | |
| 3 | 107967 | |
| 1 | 87724 | |
| 2 | 84224 | |
| 7 | 76782 | |
| 8 | 60464 | 6.4% |
| 6 | 56108 | 6.0% |
| 0 | 51478 | 5.5% |
| 4 | 48645 | 5.2% |
| Other values (2) | 76478 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 939278 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 151786 | |
| 5 | 137622 | |
| 3 | 107967 | |
| 1 | 87724 | |
| 2 | 84224 | |
| 7 | 76782 | |
| 8 | 60464 | 6.4% |
| 6 | 56108 | 6.0% |
| 0 | 51478 | 5.5% |
| 4 | 48645 | 5.2% |
| Other values (2) | 76478 |
decimalLongitude
Text
Missing 
| Distinct | 9940 |
|---|---|
| Distinct (%) | 6.5% |
| Missing | 139112 |
| Missing (%) | 47.8% |
| Memory size | 2.2 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 6.289394279 |
| Min length | 3 |
Unique
| Unique | 3493 ? |
|---|---|
| Unique (%) | 2.3% |
Sample
| 1st row | 4.5333 |
|---|---|
| 2nd row | 137.5083 |
| 3rd row | -16.0 |
| 4th row | 169.89263 |
| 5th row | 48.27 |
| Value | Count | Frequency (%) |
| 106.9167 | 1795 | 1.2% |
| 107.0 | 1160 | 0.8% |
| 106.925 | 1135 | 0.7% |
| 106.8 | 1065 | 0.7% |
| 4.875 | 997 | 0.7% |
| 4.425 | 757 | 0.5% |
| 124.8583 | 753 | 0.5% |
| 98.675 | 723 | 0.5% |
| 106.825 | 711 | 0.5% |
| 6.1 | 699 | 0.5% |
| Other values (9112) | 141991 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 151786 | |
| 1 | 122157 | |
| 5 | 102551 | |
| 3 | 90578 | |
| 7 | 85418 | |
| 4 | 73953 | |
| 0 | 73376 | |
| 8 | 73201 | |
| 6 | 63754 | |
| 2 | 51747 | 5.4% |
| Other values (2) | 66121 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 781858 | |
| Other Punctuation | 151786 | 15.9% |
| Dash Punctuation | 20998 | 2.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 122157 | |
| 5 | 102551 | |
| 3 | 90578 | |
| 7 | 85418 | |
| 4 | 73953 | |
| 0 | 73376 | |
| 8 | 73201 | |
| 6 | 63754 | |
| 2 | 51747 | |
| 9 | 45123 | 5.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 151786 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 20998 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 954642 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 151786 | |
| 1 | 122157 | |
| 5 | 102551 | |
| 3 | 90578 | |
| 7 | 85418 | |
| 4 | 73953 | |
| 0 | 73376 | |
| 8 | 73201 | |
| 6 | 63754 | |
| 2 | 51747 | 5.4% |
| Other values (2) | 66121 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 954642 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 151786 | |
| 1 | 122157 | |
| 5 | 102551 | |
| 3 | 90578 | |
| 7 | 85418 | |
| 4 | 73953 | |
| 0 | 73376 | |
| 8 | 73201 | |
| 6 | 63754 | |
| 2 | 51747 | 5.4% |
| Other values (2) | 66121 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 173 |
|---|---|
| Distinct (%) | 10.4% |
| Missing | 289239 |
| Missing (%) | 99.4% |
| Memory size | 2.2 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 5.42676311 |
| Min length | 3 |
Unique
| Unique | 73 ? |
|---|---|
| Unique (%) | 4.4% |
Sample
| 1st row | 640000.0 |
|---|---|
| 2nd row | 20000.0 |
| 3rd row | 640000.0 |
| 4th row | 1000.0 |
| 5th row | 1.0 |
| Value | Count | Frequency (%) |
| 5.0 | 399 | |
| 82230.0 | 131 | 7.9% |
| 60697.0 | 87 | 5.2% |
| 100.0 | 71 | 4.3% |
| 216478.0 | 65 | 3.9% |
| 1000.0 | 48 | 2.9% |
| 2000.0 | 47 | 2.8% |
| 200.0 | 41 | 2.5% |
| 5196.0 | 40 | 2.4% |
| 50.0 | 37 | 2.2% |
| Other values (163) | 693 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3474 | |
| . | 1659 | |
| 5 | 693 | 7.7% |
| 2 | 585 | 6.5% |
| 6 | 556 | 6.2% |
| 1 | 437 | 4.9% |
| 7 | 386 | 4.3% |
| 4 | 331 | 3.7% |
| 8 | 315 | 3.5% |
| 3 | 303 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7344 | |
| Other Punctuation | 1659 | 18.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3474 | |
| 5 | 693 | 9.4% |
| 2 | 585 | 8.0% |
| 6 | 556 | 7.6% |
| 1 | 437 | 6.0% |
| 7 | 386 | 5.3% |
| 4 | 331 | 4.5% |
| 8 | 315 | 4.3% |
| 3 | 303 | 4.1% |
| 9 | 264 | 3.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1659 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9003 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3474 | |
| . | 1659 | |
| 5 | 693 | 7.7% |
| 2 | 585 | 6.5% |
| 6 | 556 | 6.2% |
| 1 | 437 | 4.9% |
| 7 | 386 | 4.3% |
| 4 | 331 | 3.7% |
| 8 | 315 | 3.5% |
| 3 | 303 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9003 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3474 | |
| . | 1659 | |
| 5 | 693 | 7.7% |
| 2 | 585 | 6.5% |
| 6 | 556 | 6.2% |
| 1 | 437 | 4.9% |
| 7 | 386 | 4.3% |
| 4 | 331 | 3.7% |
| 8 | 315 | 3.5% |
| 3 | 303 | 3.4% |
typeStatus
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 287427 |
| Missing (%) | 98.8% |
| Memory size | 2.2 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 7.703831749 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SYNTYPE |
|---|---|
| 2nd row | SYNTYPE |
| 3rd row | SYNTYPE |
| 4th row | PARATYPE |
| 5th row | PARATYPE |
| Value | Count | Frequency (%) |
| syntype | 2278 | |
| paratype | 500 | 14.4% |
| holotype | 369 | 10.6% |
| paralectotype | 239 | 6.9% |
| lectotype | 79 | 2.3% |
| type | 6 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 5749 | |
| P | 4210 | |
| T | 3789 | |
| E | 3789 | |
| S | 2278 | 8.5% |
| N | 2278 | 8.5% |
| A | 1478 | 5.5% |
| O | 1056 | 3.9% |
| R | 739 | 2.8% |
| L | 687 | 2.6% |
| Other values (2) | 687 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 26740 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 5749 | |
| P | 4210 | |
| T | 3789 | |
| E | 3789 | |
| S | 2278 | 8.5% |
| N | 2278 | 8.5% |
| A | 1478 | 5.5% |
| O | 1056 | 3.9% |
| R | 739 | 2.8% |
| L | 687 | 2.6% |
| Other values (2) | 687 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26740 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 5749 | |
| P | 4210 | |
| T | 3789 | |
| E | 3789 | |
| S | 2278 | 8.5% |
| N | 2278 | 8.5% |
| A | 1478 | 5.5% |
| O | 1056 | 3.9% |
| R | 739 | 2.8% |
| L | 687 | 2.6% |
| Other values (2) | 687 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26740 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 5749 | |
| P | 4210 | |
| T | 3789 | |
| E | 3789 | |
| S | 2278 | 8.5% |
| N | 2278 | 8.5% |
| A | 1478 | 5.5% |
| O | 1056 | 3.9% |
| R | 739 | 2.8% |
| L | 687 | 2.6% |
| Other values (2) | 687 | 2.6% |
identifiedBy
Text
Missing 
| Distinct | 48 |
|---|---|
| Distinct (%) | 11.7% |
| Missing | 290486 |
| Missing (%) | 99.9% |
| Memory size | 2.2 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 9 |
| Mean length | 9.708737864 |
| Min length | 4 |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | 5.1% |
Sample
| 1st row | Rijswijk C. van |
|---|---|
| 2nd row | Konter A. |
| 3rd row | Konter A. |
| 4th row | Voous of Wattel? |
| 5th row | Voous |
| Value | Count | Frequency (%) |
| konter | 165 | |
| a | 165 | |
| dekker | 113 | |
| r | 113 | |
| voous | 32 | 3.9% |
| roselaar | 21 | 2.5% |
| jansen | 11 | 1.3% |
| j.f.j | 11 | 1.3% |
| k | 11 | 1.3% |
| of | 9 | 1.1% |
| Other values (72) | 173 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 527 | |
| 412 | ||
| . | 408 | |
| r | 342 | 8.6% |
| o | 283 | 7.1% |
| k | 242 | 6.0% |
| n | 218 | 5.5% |
| t | 206 | 5.1% |
| K | 184 | 4.6% |
| A | 166 | 4.2% |
| Other values (48) | 1012 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2283 | |
| Uppercase Letter | 833 | 20.8% |
| Other Punctuation | 418 | 10.4% |
| Space Separator | 412 | 10.3% |
| Decimal Number | 48 | 1.2% |
| Open Punctuation | 3 | 0.1% |
| Close Punctuation | 3 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 527 | |
| r | 342 | |
| o | 283 | |
| k | 242 | |
| n | 218 | |
| t | 206 | 9.0% |
| a | 108 | 4.7% |
| s | 95 | 4.2% |
| l | 60 | 2.6% |
| u | 41 | 1.8% |
| Other values (13) | 161 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 184 | |
| A | 166 | |
| R | 137 | |
| D | 121 | |
| V | 43 | 5.2% |
| J | 36 | 4.3% |
| P | 22 | 2.6% |
| S | 19 | 2.3% |
| W | 16 | 1.9% |
| C | 15 | 1.8% |
| Other values (11) | 74 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 13 | |
| 1 | 11 | |
| 2 | 10 | |
| 3 | 8 | |
| 5 | 2 | 4.2% |
| 9 | 2 | 4.2% |
| 8 | 1 | 2.1% |
| 4 | 1 | 2.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 408 | |
| ? | 9 | 2.2% |
| & | 1 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 412 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3116 | |
| Common | 884 | 22.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 527 | |
| r | 342 | |
| o | 283 | |
| k | 242 | 7.8% |
| n | 218 | 7.0% |
| t | 206 | 6.6% |
| K | 184 | 5.9% |
| A | 166 | 5.3% |
| R | 137 | 4.4% |
| D | 121 | 3.9% |
| Other values (34) | 690 |
Common
| Value | Count | Frequency (%) |
| 412 | ||
| . | 408 | |
| 0 | 13 | 1.5% |
| 1 | 11 | 1.2% |
| 2 | 10 | 1.1% |
| ? | 9 | 1.0% |
| 3 | 8 | 0.9% |
| ( | 3 | 0.3% |
| ) | 3 | 0.3% |
| 5 | 2 | 0.2% |
| Other values (4) | 5 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 527 | |
| 412 | ||
| . | 408 | |
| r | 342 | 8.6% |
| o | 283 | 7.1% |
| k | 242 | 6.0% |
| n | 218 | 5.5% |
| t | 206 | 5.1% |
| K | 184 | 4.6% |
| A | 166 | 4.2% |
| Other values (48) | 1012 |
dateIdentified
Text
Missing 
| Distinct | 40 |
|---|---|
| Distinct (%) | 15.6% |
| Missing | 290641 |
| Missing (%) | 99.9% |
| Memory size | 2.2 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 25 ? |
|---|---|
| Unique (%) | 9.7% |
Sample
| 1st row | 2022-07-01T00:00:00 |
|---|---|
| 2nd row | 2022-04-25T00:00:00 |
| 3rd row | 2022-04-25T00:00:00 |
| 4th row | 1964-01-01T00:00:00 |
| 5th row | 2022-04-25T00:00:00 |
| Value | Count | Frequency (%) |
| 2022-04-25t00:00:00 | 165 | |
| 2018-05-31t00:00:00 | 13 | 5.1% |
| 2021-07-01t00:00:00 | 11 | 4.3% |
| 1964-01-01t00:00:00 | 10 | 3.9% |
| 2014-10-28t00:00:00 | 7 | 2.7% |
| 2014-10-20t00:00:00 | 4 | 1.6% |
| 2023-12-28t00:00:00 | 3 | 1.2% |
| 2022-08-31t00:00:00 | 3 | 1.2% |
| 2017-04-17t00:00:00 | 3 | 1.2% |
| 2023-01-01t00:00:00 | 3 | 1.2% |
| Other values (30) | 35 | 13.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2092 | |
| 2 | 814 | 16.7% |
| - | 514 | 10.5% |
| : | 514 | 10.5% |
| T | 257 | 5.3% |
| 4 | 195 | 4.0% |
| 5 | 184 | 3.8% |
| 1 | 175 | 3.6% |
| 8 | 38 | 0.8% |
| 3 | 33 | 0.7% |
| Other values (3) | 67 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3598 | |
| Dash Punctuation | 514 | 10.5% |
| Other Punctuation | 514 | 10.5% |
| Uppercase Letter | 257 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2092 | |
| 2 | 814 | 22.6% |
| 4 | 195 | 5.4% |
| 5 | 184 | 5.1% |
| 1 | 175 | 4.9% |
| 8 | 38 | 1.1% |
| 3 | 33 | 0.9% |
| 7 | 28 | 0.8% |
| 9 | 23 | 0.6% |
| 6 | 16 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 514 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 514 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 257 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4626 | |
| Latin | 257 | 5.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2092 | |
| 2 | 814 | 17.6% |
| - | 514 | 11.1% |
| : | 514 | 11.1% |
| 4 | 195 | 4.2% |
| 5 | 184 | 4.0% |
| 1 | 175 | 3.8% |
| 8 | 38 | 0.8% |
| 3 | 33 | 0.7% |
| 7 | 28 | 0.6% |
| Other values (2) | 39 | 0.8% |
Latin
| Value | Count | Frequency (%) |
| T | 257 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2092 | |
| 2 | 814 | 16.7% |
| - | 514 | 10.5% |
| : | 514 | 10.5% |
| T | 257 | 5.3% |
| 4 | 195 | 4.0% |
| 5 | 184 | 3.8% |
| 1 | 175 | 3.6% |
| 8 | 38 | 0.8% |
| 3 | 33 | 0.7% |
| Other values (3) | 67 | 1.4% |
| Distinct | 14746 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 2.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.988418581 |
| Min length | 1 |
Unique
| Unique | 3066 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | 2484620 |
|---|---|
| 2nd row | 7342142 |
| 3rd row | 2479504 |
| 4th row | 6170652 |
| 5th row | 6170887 |
| Value | Count | Frequency (%) |
| 5231191 | 1635 | 0.6% |
| 6172874 | 1489 | 0.5% |
| 6065824 | 1204 | 0.4% |
| 6171845 | 1145 | 0.4% |
| 2480242 | 1135 | 0.4% |
| 7191198 | 1017 | 0.3% |
| 7341902 | 981 | 0.3% |
| 9156140 | 924 | 0.3% |
| 7192432 | 869 | 0.3% |
| 8990910 | 856 | 0.3% |
| Other values (14736) | 279642 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 262848 | |
| 4 | 256924 | |
| 1 | 235914 | |
| 7 | 227945 | |
| 9 | 218745 | |
| 8 | 186874 | |
| 6 | 180095 | |
| 0 | 173457 | |
| 5 | 148365 | |
| 3 | 141743 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2032910 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 262848 | |
| 4 | 256924 | |
| 1 | 235914 | |
| 7 | 227945 | |
| 9 | 218745 | |
| 8 | 186874 | |
| 6 | 180095 | |
| 0 | 173457 | |
| 5 | 148365 | |
| 3 | 141743 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2032910 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 262848 | |
| 4 | 256924 | |
| 1 | 235914 | |
| 7 | 227945 | |
| 9 | 218745 | |
| 8 | 186874 | |
| 6 | 180095 | |
| 0 | 173457 | |
| 5 | 148365 | |
| 3 | 141743 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2032910 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 262848 | |
| 4 | 256924 | |
| 1 | 235914 | |
| 7 | 227945 | |
| 9 | 218745 | |
| 8 | 186874 | |
| 6 | 180095 | |
| 0 | 173457 | |
| 5 | 148365 | |
| 3 | 141743 |
scientificName
Text
| Distinct | 15605 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 73 |
|---|---|
| Median length | 62 |
| Mean length | 33.46860068 |
| Min length | 4 |
Unique
| Unique | 3323 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | Vidua orientalis Heuglin, 1870 |
|---|---|
| 2nd row | Turdus viscivorus viscivorus |
| 3rd row | Neophema splendida (Gould, 1841) |
| 4th row | Platycercus elegans melanopterus North, 1906 |
| 5th row | Polytelis anthopeplus monarchoides Schodde, 1993 |
| Value | Count | Frequency (%) |
| linnaeus | 44499 | 3.9% |
| 1758 | 35006 | 3.1% |
| temminck | 8993 | 0.8% |
| vieillot | 8822 | 0.8% |
| 1766 | 8577 | 0.8% |
| 8565 | 0.8% | |
| 1789 | 7166 | 0.6% |
| 1821 | 6706 | 0.6% |
| horsfield | 6369 | 0.6% |
| gmelin | 5943 | 0.5% |
| Other values (10019) | 994615 |
Most occurring characters
| Value | Count | Frequency (%) |
| 844363 | 8.7% | |
| a | 843067 | 8.7% |
| i | 697116 | 7.2% |
| s | 650497 | 6.7% |
| e | 563060 | 5.8% |
| r | 541457 | 5.6% |
| u | 521696 | 5.4% |
| n | 513245 | 5.3% |
| l | 460926 | 4.7% |
| o | 452646 | 4.6% |
| Other values (72) | 3647876 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7140818 | |
| Space Separator | 844363 | 8.7% |
| Decimal Number | 755884 | 7.8% |
| Uppercase Letter | 534717 | 5.5% |
| Other Punctuation | 240381 | 2.5% |
| Close Punctuation | 109373 | 1.1% |
| Open Punctuation | 109373 | 1.1% |
| Dash Punctuation | 1039 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 843067 | |
| i | 697116 | |
| s | 650497 | |
| e | 563060 | 7.9% |
| r | 541457 | 7.6% |
| u | 521696 | 7.3% |
| n | 513245 | 7.2% |
| l | 460926 | 6.5% |
| o | 452646 | 6.3% |
| c | 359670 | 5.0% |
| Other values (25) | 1537438 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 78527 | |
| P | 55836 | |
| C | 49872 | |
| S | 46252 | 8.6% |
| A | 38918 | 7.3% |
| G | 32535 | 6.1% |
| T | 32243 | 6.0% |
| M | 27609 | 5.2% |
| B | 26545 | 5.0% |
| H | 26230 | 4.9% |
| Other values (17) | 120150 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 228842 | |
| 8 | 165376 | |
| 7 | 99308 | |
| 5 | 54960 | 7.3% |
| 9 | 44245 | 5.9% |
| 6 | 43057 | 5.7% |
| 2 | 36834 | 4.9% |
| 3 | 33830 | 4.5% |
| 0 | 26848 | 3.6% |
| 4 | 22584 | 3.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 188988 | |
| . | 42650 | 17.7% |
| & | 8562 | 3.6% |
| ' | 178 | 0.1% |
| ? | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 844363 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 109373 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 109373 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1039 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7675535 | |
| Common | 2060414 | 21.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 843067 | |
| i | 697116 | 9.1% |
| s | 650497 | 8.5% |
| e | 563060 | 7.3% |
| r | 541457 | 7.1% |
| u | 521696 | 6.8% |
| n | 513245 | 6.7% |
| l | 460926 | 6.0% |
| o | 452646 | 5.9% |
| c | 359670 | 4.7% |
| Other values (52) | 2072155 |
Common
| Value | Count | Frequency (%) |
| 844363 | ||
| 1 | 228842 | 11.1% |
| , | 188988 | 9.2% |
| 8 | 165376 | 8.0% |
| ) | 109373 | 5.3% |
| ( | 109373 | 5.3% |
| 7 | 99308 | 4.8% |
| 5 | 54960 | 2.7% |
| 9 | 44245 | 2.1% |
| 6 | 43057 | 2.1% |
| Other values (10) | 172529 | 8.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9732344 | |
| None | 3605 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 844363 | 8.7% | |
| a | 843067 | 8.7% |
| i | 697116 | 7.2% |
| s | 650497 | 6.7% |
| e | 563060 | 5.8% |
| r | 541457 | 5.6% |
| u | 521696 | 5.4% |
| n | 513245 | 5.3% |
| l | 460926 | 4.7% |
| o | 452646 | 4.7% |
| Other values (61) | 3644271 |
None
| Value | Count | Frequency (%) |
| ü | 2204 | |
| ø | 470 | 13.0% |
| é | 383 | 10.6% |
| ä | 257 | 7.1% |
| á | 181 | 5.0% |
| è | 40 | 1.1% |
| É | 38 | 1.1% |
| ö | 14 | 0.4% |
| ë | 12 | 0.3% |
| ñ | 5 | 0.1% |
| Distinct | 310 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 43 |
| Mean length | 16.58477886 |
| Min length | 8 |
Unique
| Unique | 22 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia|Viduidae |
|---|---|
| 2nd row | Animalia|Turdidae |
| 3rd row | Animalia|Psittacidae |
| 4th row | Animalia|Psittacidae |
| 5th row | Animalia|Psittacidae |
| Value | Count | Frequency (%) |
| animalia | 74175 | |
| animalia|turdidae | 13154 | 4.5% |
| animalia|scolopacidae | 11012 | 3.8% |
| animalia|sylviidae | 10286 | 3.5% |
| animalia|emberizidae | 8024 | 2.8% |
| animalia|fringillidae | 7443 | 2.6% |
| animalia|corvidae | 7140 | 2.4% |
| animalia|ardeidae | 5218 | 1.8% |
| animalia|timaliidae | 5010 | 1.7% |
| animalia|charadriidae | 4907 | 1.7% |
| Other values (298) | 145238 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 936776 | |
| a | 917961 | |
| l | 393768 | |
| n | 357847 | 7.4% |
| m | 319003 | 6.6% |
| A | 316570 | 6.6% |
| e | 276152 | 5.7% |
| d | 261177 | 5.4% |
| | | 221132 | 4.6% |
| r | 138033 | 2.9% |
| Other values (42) | 686060 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4088471 | |
| Uppercase Letter | 513434 | 10.6% |
| Math Symbol | 221132 | 4.6% |
| Other Punctuation | 733 | < 0.1% |
| Space Separator | 709 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 936776 | |
| a | 917961 | |
| l | 393768 | |
| n | 357847 | 8.8% |
| m | 319003 | 7.8% |
| e | 276152 | 6.8% |
| d | 261177 | 6.4% |
| r | 138033 | 3.4% |
| c | 98653 | 2.4% |
| o | 93715 | 2.3% |
| Other values (13) | 295386 | 7.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 316570 | |
| P | 35114 | 6.8% |
| T | 32092 | 6.3% |
| S | 31063 | 6.1% |
| C | 24935 | 4.9% |
| M | 14619 | 2.8% |
| E | 13061 | 2.5% |
| F | 11340 | 2.2% |
| L | 6846 | 1.3% |
| N | 4455 | 0.9% |
| Other values (12) | 23339 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 679 | |
| ? | 39 | 5.3% |
| / | 12 | 1.6% |
| , | 2 | 0.3% |
| . | 1 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 221132 |
Space Separator
| Value | Count | Frequency (%) |
| 709 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4601905 | |
| Common | 222574 | 4.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 936776 | |
| a | 917961 | |
| l | 393768 | |
| n | 357847 | 7.8% |
| m | 319003 | 6.9% |
| A | 316570 | 6.9% |
| e | 276152 | 6.0% |
| d | 261177 | 5.7% |
| r | 138033 | 3.0% |
| c | 98653 | 2.1% |
| Other values (35) | 585965 |
Common
| Value | Count | Frequency (%) |
| | | 221132 | |
| 709 | 0.3% | |
| : | 679 | 0.3% |
| ? | 39 | < 0.1% |
| / | 12 | < 0.1% |
| , | 2 | < 0.1% |
| . | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4824479 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 936776 | |
| a | 917961 | |
| l | 393768 | |
| n | 357847 | 7.4% |
| m | 319003 | 6.6% |
| A | 316570 | 6.6% |
| e | 276152 | 5.7% |
| d | 261177 | 5.4% |
| | | 221132 | 4.6% |
| r | 138033 | 2.9% |
| Other values (42) | 686060 |
kingdom
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 8 |
| Mean length | 8.000020626 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 290897 | |
| incertae | 1 | < 0.1% |
| sedis | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 581796 | |
| a | 581795 | |
| n | 290898 | |
| A | 290897 | |
| m | 290897 | |
| l | 290897 | |
| e | 3 | < 0.1% |
| s | 2 | < 0.1% |
| c | 1 | < 0.1% |
| r | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2036292 | |
| Uppercase Letter | 290897 | 12.5% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 581796 | |
| a | 581795 | |
| n | 290898 | |
| m | 290897 | |
| l | 290897 | |
| e | 3 | < 0.1% |
| s | 2 | < 0.1% |
| c | 1 | < 0.1% |
| r | 1 | < 0.1% |
| t | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 290897 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2327189 | |
| Common | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 581796 | |
| a | 581795 | |
| n | 290898 | |
| A | 290897 | |
| m | 290897 | |
| l | 290897 | |
| e | 3 | < 0.1% |
| s | 2 | < 0.1% |
| c | 1 | < 0.1% |
| r | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2327190 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 581796 | |
| a | 581795 | |
| n | 290898 | |
| A | 290897 | |
| m | 290897 | |
| l | 290897 | |
| e | 3 | < 0.1% |
| s | 2 | < 0.1% |
| c | 1 | < 0.1% |
| r | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
phylum
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 766 |
| Missing (%) | 0.3% |
| Memory size | 2.2 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 8.000461859 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Chordata |
| 3rd row | Chordata |
| 4th row | Chordata |
| 5th row | Chordata |
| Value | Count | Frequency (%) |
| chordata | 290063 | |
| arthropoda | 67 | < 0.1% |
| mollusca | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 580195 | |
| o | 290199 | |
| r | 290197 | |
| h | 290130 | |
| d | 290130 | |
| t | 290130 | |
| C | 290063 | |
| A | 67 | < 0.1% |
| p | 67 | < 0.1% |
| l | 4 | < 0.1% |
| Other values (4) | 8 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2031058 | |
| Uppercase Letter | 290132 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 580195 | |
| o | 290199 | |
| r | 290197 | |
| h | 290130 | |
| d | 290130 | |
| t | 290130 | |
| p | 67 | < 0.1% |
| l | 4 | < 0.1% |
| u | 2 | < 0.1% |
| s | 2 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 290063 | |
| A | 67 | < 0.1% |
| M | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2321190 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 580195 | |
| o | 290199 | |
| r | 290197 | |
| h | 290130 | |
| d | 290130 | |
| t | 290130 | |
| C | 290063 | |
| A | 67 | < 0.1% |
| p | 67 | < 0.1% |
| l | 4 | < 0.1% |
| Other values (4) | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2321190 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 580195 | |
| o | 290199 | |
| r | 290197 | |
| h | 290130 | |
| d | 290130 | |
| t | 290130 | |
| C | 290063 | |
| A | 67 | < 0.1% |
| p | 67 | < 0.1% |
| l | 4 | < 0.1% |
| Other values (4) | 8 | < 0.1% |
class
Text
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 770 |
| Missing (%) | 0.3% |
| Memory size | 2.2 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 4 |
| Mean length | 4.000820328 |
| Min length | 4 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Aves |
|---|---|
| 2nd row | Aves |
| 3rd row | Aves |
| 4th row | Aves |
| 5th row | Aves |
| Value | Count | Frequency (%) |
| aves | 290053 | |
| insecta | 66 | < 0.1% |
| mammalia | 6 | < 0.1% |
| bivalvia | 1 | < 0.1% |
| squamata | 1 | < 0.1% |
| malacostraca | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 290120 | |
| e | 290119 | |
| v | 290055 | |
| A | 290053 | |
| a | 93 | < 0.1% |
| c | 68 | < 0.1% |
| t | 68 | < 0.1% |
| I | 66 | < 0.1% |
| n | 66 | < 0.1% |
| m | 13 | < 0.1% |
| Other values (9) | 29 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 870622 | |
| Uppercase Letter | 290128 | 25.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 290120 | |
| e | 290119 | |
| v | 290055 | |
| a | 93 | < 0.1% |
| c | 68 | < 0.1% |
| t | 68 | < 0.1% |
| n | 66 | < 0.1% |
| m | 13 | < 0.1% |
| i | 8 | < 0.1% |
| l | 8 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 290053 | |
| I | 66 | < 0.1% |
| M | 7 | < 0.1% |
| B | 1 | < 0.1% |
| S | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1160750 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 290120 | |
| e | 290119 | |
| v | 290055 | |
| A | 290053 | |
| a | 93 | < 0.1% |
| c | 68 | < 0.1% |
| t | 68 | < 0.1% |
| I | 66 | < 0.1% |
| n | 66 | < 0.1% |
| m | 13 | < 0.1% |
| Other values (9) | 29 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1160750 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 290120 | |
| e | 290119 | |
| v | 290055 | |
| A | 290053 | |
| a | 93 | < 0.1% |
| c | 68 | < 0.1% |
| t | 68 | < 0.1% |
| I | 66 | < 0.1% |
| n | 66 | < 0.1% |
| m | 13 | < 0.1% |
| Other values (9) | 29 | < 0.1% |
order
Text
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1492 |
| Missing (%) | 0.5% |
| Memory size | 2.2 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 13 |
| Mean length | 13.08270388 |
| Min length | 7 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Passeriformes |
|---|---|
| 2nd row | Passeriformes |
| 3rd row | Psittaciformes |
| 4th row | Psittaciformes |
| 5th row | Psittaciformes |
| Value | Count | Frequency (%) |
| passeriformes | 145670 | |
| charadriiformes | 33385 | 11.5% |
| accipitriformes | 10340 | 3.6% |
| anseriformes | 10163 | 3.5% |
| columbiformes | 9902 | 3.4% |
| piciformes | 8355 | 2.9% |
| galliformes | 7462 | 2.6% |
| apodiformes | 7409 | 2.6% |
| pelecaniformes | 7141 | 2.5% |
| coraciiformes | 7019 | 2.4% |
| Other values (43) | 42560 | 14.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 598367 | |
| r | 550123 | |
| e | 466963 | |
| i | 383264 | |
| o | 327105 | |
| m | 301232 | |
| f | 289334 | |
| a | 250700 | |
| P | 173212 | 4.6% |
| c | 67656 | 1.8% |
| Other values (26) | 378257 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3496807 | |
| Uppercase Letter | 289406 | 7.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 598367 | |
| r | 550123 | |
| e | 466963 | |
| i | 383264 | |
| o | 327105 | |
| m | 301232 | |
| f | 289334 | |
| a | 250700 | |
| c | 67656 | 1.9% |
| l | 50713 | 1.5% |
| Other values (10) | 211350 | 6.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 173212 | |
| C | 58998 | 20.4% |
| A | 27958 | 9.7% |
| G | 14500 | 5.0% |
| S | 7509 | 2.6% |
| F | 4075 | 1.4% |
| B | 1360 | 0.5% |
| T | 1048 | 0.4% |
| O | 266 | 0.1% |
| M | 226 | 0.1% |
| Other values (6) | 254 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3786213 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 598367 | |
| r | 550123 | |
| e | 466963 | |
| i | 383264 | |
| o | 327105 | |
| m | 301232 | |
| f | 289334 | |
| a | 250700 | |
| P | 173212 | 4.6% |
| c | 67656 | 1.8% |
| Other values (26) | 378257 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3786213 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 598367 | |
| r | 550123 | |
| e | 466963 | |
| i | 383264 | |
| o | 327105 | |
| m | 301232 | |
| f | 289334 | |
| a | 250700 | |
| P | 173212 | 4.6% |
| c | 67656 | 1.8% |
| Other values (26) | 378257 |
family
Text
| Distinct | 250 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1542 |
| Missing (%) | 0.5% |
| Memory size | 2.2 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 10.45621311 |
| Min length | 7 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Viduidae |
|---|---|
| 2nd row | Turdidae |
| 3rd row | Psittacidae |
| 4th row | Psittacidae |
| 5th row | Psittacidae |
| Value | Count | Frequency (%) |
| scolopacidae | 11999 | 4.1% |
| muscicapidae | 11304 | 3.9% |
| anatidae | 10104 | 3.5% |
| accipitridae | 9920 | 3.4% |
| columbidae | 9902 | 3.4% |
| laridae | 9646 | 3.3% |
| fringillidae | 8978 | 3.1% |
| corvidae | 7653 | 2.6% |
| turdidae | 6987 | 2.4% |
| psittacidae | 6737 | 2.3% |
| Other values (240) | 196126 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 466305 | |
| a | 457770 | |
| e | 358881 | |
| d | 337256 | |
| r | 160715 | 5.3% |
| c | 157588 | 5.2% |
| l | 139993 | 4.6% |
| o | 124575 | 4.1% |
| t | 93283 | 3.1% |
| n | 90718 | 3.0% |
| Other values (32) | 638484 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2736212 | |
| Uppercase Letter | 289356 | 9.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 466305 | |
| a | 457770 | |
| e | 358881 | |
| d | 337256 | |
| r | 160715 | 5.9% |
| c | 157588 | 5.8% |
| l | 139993 | 5.1% |
| o | 124575 | 4.6% |
| t | 93283 | 3.4% |
| n | 90718 | 3.3% |
| Other values (11) | 349128 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 52163 | |
| A | 42928 | |
| C | 41817 | |
| T | 29194 | |
| S | 26989 | |
| M | 24976 | |
| L | 14587 | 5.0% |
| F | 14496 | 5.0% |
| R | 9279 | 3.2% |
| E | 7381 | 2.6% |
| Other values (11) | 25546 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3025568 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 466305 | |
| a | 457770 | |
| e | 358881 | |
| d | 337256 | |
| r | 160715 | 5.3% |
| c | 157588 | 5.2% |
| l | 139993 | 4.6% |
| o | 124575 | 4.1% |
| t | 93283 | 3.1% |
| n | 90718 | 3.0% |
| Other values (32) | 638484 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3025568 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 466305 | |
| a | 457770 | |
| e | 358881 | |
| d | 337256 | |
| r | 160715 | 5.3% |
| c | 157588 | 5.2% |
| l | 139993 | 4.6% |
| o | 124575 | 4.1% |
| t | 93283 | 3.1% |
| n | 90718 | 3.0% |
| Other values (32) | 638484 |
genus
Text
| Distinct | 2192 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 1404 |
| Missing (%) | 0.5% |
| Memory size | 2.2 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 15 |
| Mean length | 8.287798711 |
| Min length | 3 |
Unique
| Unique | 157 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Vidua |
|---|---|
| 2nd row | Turdus |
| 3rd row | Neophema |
| 4th row | Platycercus |
| 5th row | Polytelis |
| Value | Count | Frequency (%) |
| turdus | 5647 | 2.0% |
| calidris | 3893 | 1.3% |
| falco | 3593 | 1.2% |
| pycnonotus | 3364 | 1.2% |
| passer | 3110 | 1.1% |
| accipiter | 2980 | 1.0% |
| sylvia | 2742 | 0.9% |
| emberiza | 2716 | 0.9% |
| larus | 2634 | 0.9% |
| corvus | 2475 | 0.9% |
| Other values (2182) | 256340 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 243380 | 10.1% |
| s | 190746 | 8.0% |
| i | 188999 | 7.9% |
| r | 183484 | 7.6% |
| o | 182529 | 7.6% |
| u | 165872 | 6.9% |
| l | 135576 | 5.7% |
| e | 130021 | 5.4% |
| c | 113946 | 4.7% |
| n | 107840 | 4.5% |
| Other values (42) | 756875 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2109774 | |
| Uppercase Letter | 289494 | 12.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 243380 | |
| s | 190746 | |
| i | 188999 | |
| r | 183484 | 8.7% |
| o | 182529 | 8.7% |
| u | 165872 | 7.9% |
| l | 135576 | 6.4% |
| e | 130021 | 6.2% |
| c | 113946 | 5.4% |
| n | 107840 | 5.1% |
| Other values (16) | 467381 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 46208 | |
| C | 45802 | |
| A | 32702 | |
| T | 22547 | |
| S | 22237 | |
| M | 17848 | 6.2% |
| L | 17759 | 6.1% |
| G | 11587 | 4.0% |
| E | 10390 | 3.6% |
| F | 9152 | 3.2% |
| Other values (16) | 53262 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2399268 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 243380 | 10.1% |
| s | 190746 | 8.0% |
| i | 188999 | 7.9% |
| r | 183484 | 7.6% |
| o | 182529 | 7.6% |
| u | 165872 | 6.9% |
| l | 135576 | 5.7% |
| e | 130021 | 5.4% |
| c | 113946 | 4.7% |
| n | 107840 | 4.5% |
| Other values (42) | 756875 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2399268 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 243380 | 10.1% |
| s | 190746 | 8.0% |
| i | 188999 | 7.9% |
| r | 183484 | 7.6% |
| o | 182529 | 7.6% |
| u | 165872 | 6.9% |
| l | 135576 | 5.7% |
| e | 130021 | 5.4% |
| c | 113946 | 4.7% |
| n | 107840 | 4.5% |
| Other values (42) | 756875 |
genericName
Text
| Distinct | 2287 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 1636 |
| Missing (%) | 0.6% |
| Memory size | 2.2 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 15 |
| Mean length | 8.139814424 |
| Min length | 1 |
Unique
| Unique | 199 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Vidua |
|---|---|
| 2nd row | Turdus |
| 3rd row | Neophema |
| 4th row | Platycercus |
| 5th row | Polytelis |
| Value | Count | Frequency (%) |
| turdus | 5646 | 2.0% |
| larus | 4361 | 1.5% |
| falco | 3593 | 1.2% |
| parus | 3587 | 1.2% |
| corvus | 3377 | 1.2% |
| pycnonotus | 3358 | 1.2% |
| sterna | 3238 | 1.1% |
| passer | 3110 | 1.1% |
| anas | 2998 | 1.0% |
| accipiter | 2980 | 1.0% |
| Other values (2277) | 253014 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 248614 | 10.6% |
| r | 185008 | 7.9% |
| s | 182957 | 7.8% |
| i | 178481 | 7.6% |
| o | 170638 | 7.2% |
| u | 166953 | 7.1% |
| e | 131390 | 5.6% |
| l | 131371 | 5.6% |
| c | 112832 | 4.8% |
| t | 105857 | 4.5% |
| Other values (44) | 740438 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2065277 | |
| Uppercase Letter | 289259 | 12.3% |
| Other Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 248614 | |
| r | 185008 | |
| s | 182957 | |
| i | 178481 | 8.6% |
| o | 170638 | 8.3% |
| u | 166953 | 8.1% |
| e | 131390 | 6.4% |
| l | 131371 | 6.4% |
| c | 112832 | 5.5% |
| t | 105857 | 5.1% |
| Other values (17) | 451176 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 46469 | |
| C | 41880 | |
| A | 34789 | |
| S | 21995 | |
| T | 20919 | 7.2% |
| M | 18710 | 6.5% |
| L | 17139 | 5.9% |
| E | 12134 | 4.2% |
| D | 10202 | 3.5% |
| H | 10049 | 3.5% |
| Other values (16) | 54973 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2354536 | |
| Common | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 248614 | 10.6% |
| r | 185008 | 7.9% |
| s | 182957 | 7.8% |
| i | 178481 | 7.6% |
| o | 170638 | 7.2% |
| u | 166953 | 7.1% |
| e | 131390 | 5.6% |
| l | 131371 | 5.6% |
| c | 112832 | 4.8% |
| t | 105857 | 4.5% |
| Other values (43) | 740435 |
Common
| Value | Count | Frequency (%) |
| ? | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2354527 | |
| None | 12 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 248614 | 10.6% |
| r | 185008 | 7.9% |
| s | 182957 | 7.8% |
| i | 178481 | 7.6% |
| o | 170638 | 7.2% |
| u | 166953 | 7.1% |
| e | 131390 | 5.6% |
| l | 131371 | 5.6% |
| c | 112832 | 4.8% |
| t | 105857 | 4.5% |
| Other values (43) | 740426 |
None
| Value | Count | Frequency (%) |
| ë | 12 |
specificEpithet
Text
Missing 
| Distinct | 4206 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 10799 |
| Missing (%) | 3.7% |
| Memory size | 2.2 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 8.536481744 |
| Min length | 3 |
Unique
| Unique | 404 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | orientalis |
|---|---|
| 2nd row | viscivorus |
| 3rd row | splendida |
| 4th row | elegans |
| 5th row | anthopeplus |
| Value | Count | Frequency (%) |
| alba | 2049 | 0.7% |
| major | 1951 | 0.7% |
| domesticus | 1907 | 0.7% |
| cinerea | 1808 | 0.6% |
| vulgaris | 1734 | 0.6% |
| montanus | 1697 | 0.6% |
| chloris | 1575 | 0.6% |
| striata | 1540 | 0.5% |
| chinensis | 1514 | 0.5% |
| cristatus | 1484 | 0.5% |
| Other values (4196) | 262840 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 296714 | |
| i | 234928 | |
| s | 232760 | |
| u | 185091 | 7.7% |
| r | 177046 | 7.4% |
| e | 163697 | 6.8% |
| l | 154142 | 6.4% |
| n | 147702 | 6.2% |
| c | 139563 | 5.8% |
| o | 136297 | 5.7% |
| Other values (16) | 523120 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2391060 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 296714 | |
| i | 234928 | |
| s | 232760 | |
| u | 185091 | 7.7% |
| r | 177046 | 7.4% |
| e | 163697 | 6.8% |
| l | 154142 | 6.4% |
| n | 147702 | 6.2% |
| c | 139563 | 5.8% |
| o | 136297 | 5.7% |
| Other values (16) | 523120 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2391060 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 296714 | |
| i | 234928 | |
| s | 232760 | |
| u | 185091 | 7.7% |
| r | 177046 | 7.4% |
| e | 163697 | 6.8% |
| l | 154142 | 6.4% |
| n | 147702 | 6.2% |
| c | 139563 | 5.8% |
| o | 136297 | 5.7% |
| Other values (16) | 523120 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2391060 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 296714 | |
| i | 234928 | |
| s | 232760 | |
| u | 185091 | 7.7% |
| r | 177046 | 7.4% |
| e | 163697 | 6.8% |
| l | 154142 | 6.4% |
| n | 147702 | 6.2% |
| c | 139563 | 5.8% |
| o | 136297 | 5.7% |
| Other values (16) | 523120 |
Missing 
| Distinct | 5180 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 125699 |
| Missing (%) | 43.2% |
| Memory size | 2.2 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 8.541080757 |
| Min length | 3 |
Unique
| Unique | 832 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | viscivorus |
|---|---|
| 2nd row | melanopterus |
| 3rd row | monarchoides |
| 4th row | rubescens |
| 5th row | meridionalis |
| Value | Count | Frequency (%) |
| domesticus | 2283 | 1.4% |
| vulgaris | 1490 | 0.9% |
| merula | 1145 | 0.7% |
| cinerea | 1131 | 0.7% |
| nisus | 1017 | 0.6% |
| glandarius | 981 | 0.6% |
| montanus | 946 | 0.6% |
| coelebs | 924 | 0.6% |
| major | 924 | 0.6% |
| cristatus | 879 | 0.5% |
| Other values (5170) | 153479 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 165124 | |
| i | 150040 | |
| s | 140413 | |
| r | 106309 | 7.5% |
| u | 103801 | 7.4% |
| e | 102944 | 7.3% |
| n | 92129 | 6.5% |
| l | 86050 | 6.1% |
| o | 79489 | 5.6% |
| c | 77292 | 5.5% |
| Other values (16) | 307387 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1410978 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 165124 | |
| i | 150040 | |
| s | 140413 | |
| r | 106309 | 7.5% |
| u | 103801 | 7.4% |
| e | 102944 | 7.3% |
| n | 92129 | 6.5% |
| l | 86050 | 6.1% |
| o | 79489 | 5.6% |
| c | 77292 | 5.5% |
| Other values (16) | 307387 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1410978 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 165124 | |
| i | 150040 | |
| s | 140413 | |
| r | 106309 | 7.5% |
| u | 103801 | 7.4% |
| e | 102944 | 7.3% |
| n | 92129 | 6.5% |
| l | 86050 | 6.1% |
| o | 79489 | 5.6% |
| c | 77292 | 5.5% |
| Other values (16) | 307387 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1410978 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 165124 | |
| i | 150040 | |
| s | 140413 | |
| r | 106309 | 7.5% |
| u | 103801 | 7.4% |
| e | 102944 | 7.3% |
| n | 92129 | 6.5% |
| l | 86050 | 6.1% |
| o | 79489 | 5.6% |
| c | 77292 | 5.5% |
| Other values (16) | 307387 |
taxonRank
Text
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 8.635483915 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SPECIES |
|---|---|
| 2nd row | SUBSPECIES |
| 3rd row | SPECIES |
| 4th row | SUBSPECIES |
| 5th row | SUBSPECIES |
| Value | Count | Frequency (%) |
| subspecies | 165197 | |
| species | 115131 | |
| genus | 9163 | 3.1% |
| class | 538 | 0.2% |
| kingdom | 486 | 0.2% |
| family | 329 | 0.1% |
| order | 47 | < 0.1% |
| unranked | 3 | < 0.1% |
| form | 3 | < 0.1% |
| phylum | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 736092 | |
| E | 569869 | |
| I | 281143 | 11.2% |
| C | 280866 | 11.2% |
| P | 280329 | 11.2% |
| U | 174364 | 6.9% |
| B | 165197 | 6.6% |
| N | 9655 | 0.4% |
| G | 9649 | 0.4% |
| A | 870 | < 0.1% |
| Other values (9) | 4011 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2512045 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 736092 | |
| E | 569869 | |
| I | 281143 | 11.2% |
| C | 280866 | 11.2% |
| P | 280329 | 11.2% |
| U | 174364 | 6.9% |
| B | 165197 | 6.6% |
| N | 9655 | 0.4% |
| G | 9649 | 0.4% |
| A | 870 | < 0.1% |
| Other values (9) | 4011 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2512045 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 736092 | |
| E | 569869 | |
| I | 281143 | 11.2% |
| C | 280866 | 11.2% |
| P | 280329 | 11.2% |
| U | 174364 | 6.9% |
| B | 165197 | 6.6% |
| N | 9655 | 0.4% |
| G | 9649 | 0.4% |
| A | 870 | < 0.1% |
| Other values (9) | 4011 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2512045 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 736092 | |
| E | 569869 | |
| I | 281143 | 11.2% |
| C | 280866 | 11.2% |
| P | 280329 | 11.2% |
| U | 174364 | 6.9% |
| B | 165197 | 6.6% |
| N | 9655 | 0.4% |
| G | 9649 | 0.4% |
| A | 870 | < 0.1% |
| Other values (9) | 4011 | 0.2% |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ICZN |
|---|---|
| 2nd row | ICZN |
| 3rd row | ICZN |
| 4th row | ICZN |
| 5th row | ICZN |
| Value | Count | Frequency (%) |
| iczn | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 290898 | |
| C | 290898 | |
| Z | 290898 | |
| N | 290898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1163592 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 290898 | |
| C | 290898 | |
| Z | 290898 | |
| N | 290898 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1163592 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 290898 | |
| C | 290898 | |
| Z | 290898 | |
| N | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1163592 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 290898 | |
| C | 290898 | |
| Z | 290898 | |
| N | 290898 |
taxonomicStatus
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 2.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.832105522 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | ACCEPTED |
| 4th row | ACCEPTED |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 239957 | |
| synonym | 48840 | 16.8% |
| doubtful | 2100 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 479914 | |
| E | 479914 | |
| T | 242057 | |
| D | 242057 | |
| A | 239957 | |
| P | 239957 | |
| Y | 97680 | 4.3% |
| N | 97680 | 4.3% |
| O | 50940 | 2.2% |
| S | 48840 | 2.1% |
| Other values (5) | 59340 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2278336 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 479914 | |
| E | 479914 | |
| T | 242057 | |
| D | 242057 | |
| A | 239957 | |
| P | 239957 | |
| Y | 97680 | 4.3% |
| N | 97680 | 4.3% |
| O | 50940 | 2.2% |
| S | 48840 | 2.1% |
| Other values (5) | 59340 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2278336 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 479914 | |
| E | 479914 | |
| T | 242057 | |
| D | 242057 | |
| A | 239957 | |
| P | 239957 | |
| Y | 97680 | 4.3% |
| N | 97680 | 4.3% |
| O | 50940 | 2.2% |
| S | 48840 | 2.1% |
| Other values (5) | 59340 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2278336 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 479914 | |
| E | 479914 | |
| T | 242057 | |
| D | 242057 | |
| A | 239957 | |
| P | 239957 | |
| Y | 97680 | 4.3% |
| N | 97680 | 4.3% |
| O | 50940 | 2.2% |
| S | 48840 | 2.1% |
| Other values (5) | 59340 | 2.6% |
datasetKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 889c91a3-614f-4355-8df8-b6d0260a118c |
|---|---|
| 2nd row | 889c91a3-614f-4355-8df8-b6d0260a118c |
| 3rd row | 889c91a3-614f-4355-8df8-b6d0260a118c |
| 4th row | 889c91a3-614f-4355-8df8-b6d0260a118c |
| 5th row | 889c91a3-614f-4355-8df8-b6d0260a118c |
| Value | Count | Frequency (%) |
| 889c91a3-614f-4355-8df8-b6d0260a118c | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 1454490 | |
| 1 | 1163592 | |
| - | 1163592 | |
| 6 | 872694 | 8.3% |
| 9 | 581796 | 5.6% |
| c | 581796 | 5.6% |
| a | 581796 | 5.6% |
| 3 | 581796 | 5.6% |
| 4 | 581796 | 5.6% |
| f | 581796 | 5.6% |
| Other values (5) | 2327184 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6690654 | |
| Lowercase Letter | 2618082 | 25.0% |
| Dash Punctuation | 1163592 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 1454490 | |
| 1 | 1163592 | |
| 6 | 872694 | |
| 9 | 581796 | 8.7% |
| 3 | 581796 | 8.7% |
| 4 | 581796 | 8.7% |
| 5 | 581796 | 8.7% |
| 0 | 581796 | 8.7% |
| 2 | 290898 | 4.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 581796 | |
| a | 581796 | |
| f | 581796 | |
| d | 581796 | |
| b | 290898 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1163592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7854246 | |
| Latin | 2618082 | 25.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 1454490 | |
| 1 | 1163592 | |
| - | 1163592 | |
| 6 | 872694 | |
| 9 | 581796 | 7.4% |
| 3 | 581796 | 7.4% |
| 4 | 581796 | 7.4% |
| 5 | 581796 | 7.4% |
| 0 | 581796 | 7.4% |
| 2 | 290898 | 3.7% |
Latin
| Value | Count | Frequency (%) |
| c | 581796 | |
| a | 581796 | |
| f | 581796 | |
| d | 581796 | |
| b | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10472328 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 1454490 | |
| 1 | 1163592 | |
| - | 1163592 | |
| 6 | 872694 | 8.3% |
| 9 | 581796 | 5.6% |
| c | 581796 | 5.6% |
| a | 581796 | 5.6% |
| 3 | 581796 | 5.6% |
| 4 | 581796 | 5.6% |
| f | 581796 | 5.6% |
| Other values (5) | 2327184 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NL |
|---|---|
| 2nd row | NL |
| 3rd row | NL |
| 4th row | NL |
| 5th row | NL |
| Value | Count | Frequency (%) |
| nl | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 290898 | |
| L | 290898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 581796 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 290898 | |
| L | 290898 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 581796 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 290898 | |
| L | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 581796 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 290898 | |
| L | 290898 |
lastInterpreted
Text
| Distinct | 24995 |
|---|---|
| Distinct (%) | 8.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99487105 |
| Min length | 20 |
Unique
| Unique | 2133 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 2025-01-03T11:41:38.952Z |
|---|---|
| 2nd row | 2025-01-03T11:41:39.036Z |
| 3rd row | 2025-01-03T11:41:41.369Z |
| 4th row | 2025-01-03T11:41:41.370Z |
| 5th row | 2025-01-03T11:41:41.379Z |
| Value | Count | Frequency (%) |
| 2025-01-03t11:42:05.126z | 149 | 0.1% |
| 2025-01-03t11:42:05.124z | 149 | 0.1% |
| 2025-01-03t11:42:05.005z | 148 | 0.1% |
| 2025-01-03t11:42:05.125z | 146 | 0.1% |
| 2025-01-03t11:42:05.127z | 145 | < 0.1% |
| 2025-01-03t11:42:05.122z | 145 | < 0.1% |
| 2025-01-03t11:42:05.010z | 142 | < 0.1% |
| 2025-01-03t11:42:04.999z | 141 | < 0.1% |
| 2025-01-03t11:42:05.042z | 139 | < 0.1% |
| 2025-01-03t11:42:04.998z | 138 | < 0.1% |
| Other values (24985) | 289456 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1161663 | |
| 0 | 1129747 | |
| 2 | 861543 | |
| - | 581796 | |
| : | 581796 | |
| 5 | 506791 | |
| 4 | 439261 | 6.3% |
| 3 | 387300 | 5.5% |
| T | 290898 | 4.2% |
| Z | 290898 | 4.2% |
| Other values (5) | 748367 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4944147 | |
| Other Punctuation | 872321 | 12.5% |
| Dash Punctuation | 581796 | 8.3% |
| Uppercase Letter | 581796 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1161663 | |
| 0 | 1129747 | |
| 2 | 861543 | |
| 5 | 506791 | |
| 4 | 439261 | 8.9% |
| 3 | 387300 | 7.8% |
| 8 | 122218 | 2.5% |
| 6 | 117907 | 2.4% |
| 9 | 114950 | 2.3% |
| 7 | 102767 | 2.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 581796 | |
| . | 290525 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 290898 | |
| Z | 290898 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 581796 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6398264 | |
| Latin | 581796 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1161663 | |
| 0 | 1129747 | |
| 2 | 861543 | |
| - | 581796 | |
| : | 581796 | |
| 5 | 506791 | |
| 4 | 439261 | 6.9% |
| 3 | 387300 | 6.1% |
| . | 290525 | 4.5% |
| 8 | 122218 | 1.9% |
| Other values (3) | 335624 | 5.2% |
Latin
| Value | Count | Frequency (%) |
| T | 290898 | |
| Z | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6980060 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1161663 | |
| 0 | 1129747 | |
| 2 | 861543 | |
| - | 581796 | |
| : | 581796 | |
| 5 | 506791 | |
| 4 | 439261 | 6.3% |
| 3 | 387300 | 5.5% |
| T | 290898 | 4.2% |
| Z | 290898 | 4.2% |
| Other values (5) | 748367 |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 227 |
|---|---|
| Distinct (%) | 13.7% |
| Missing | 289238 |
| Missing (%) | 99.4% |
| Memory size | 2.2 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 16.78433735 |
| Min length | 3 |
Unique
| Unique | 91 ? |
|---|---|
| Unique (%) | 5.5% |
Sample
| 1st row | 1294.2466800739585 |
|---|---|
| 2nd row | 2872.769076848754 |
| 3rd row | 3250.592564219525 |
| 4th row | 3894.154755246927 |
| 5th row | 4465.683444064726 |
| Value | Count | Frequency (%) |
| 2704.885187212414 | 232 | 14.0% |
| 1241.6133704433169 | 74 | 4.5% |
| 2872.769076848754 | 69 | 4.2% |
| 0.0 | 60 | 3.6% |
| 1292.3392160898957 | 49 | 3.0% |
| 4419.575196162919 | 48 | 2.9% |
| 1167.2226527660587 | 48 | 2.9% |
| 2874.034733991636 | 46 | 2.8% |
| 2907.040794219252 | 39 | 2.3% |
| 4191.557332376314 | 37 | 2.2% |
| Other values (217) | 958 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3286 | |
| 4 | 3194 | |
| 2 | 3081 | |
| 8 | 2849 | |
| 7 | 2844 | |
| 3 | 2330 | |
| 5 | 2277 | |
| 6 | 2169 | |
| 9 | 2146 | |
| 0 | 2026 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26202 | |
| Other Punctuation | 1660 | 6.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3286 | |
| 4 | 3194 | |
| 2 | 3081 | |
| 8 | 2849 | |
| 7 | 2844 | |
| 3 | 2330 | |
| 5 | 2277 | |
| 6 | 2169 | |
| 9 | 2146 | |
| 0 | 2026 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1660 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 27862 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3286 | |
| 4 | 3194 | |
| 2 | 3081 | |
| 8 | 2849 | |
| 7 | 2844 | |
| 3 | 2330 | |
| 5 | 2277 | |
| 6 | 2169 | |
| 9 | 2146 | |
| 0 | 2026 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27862 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3286 | |
| 4 | 3194 | |
| 2 | 3081 | |
| 8 | 2849 | |
| 7 | 2844 | |
| 3 | 2330 | |
| 5 | 2277 | |
| 6 | 2169 | |
| 9 | 2146 | |
| 0 | 2026 |
issue
Text
| Distinct | 78 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 187 |
|---|---|
| Median length | 182 |
| Mean length | 102.3443991 |
| Min length | 31 |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | INSTITUTION_COLLECTION_MISMATCH |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;CONTINENT_DERIVED_FROM_COORDINATES;INSTITUTION_COLLECTION_MISMATCH |
| 3rd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;CONTINENT_DERIVED_FROM_COUNTRY;INSTITUTION_COLLECTION_MISMATCH |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;CONTINENT_DERIVED_FROM_COORDINATES;INSTITUTION_COLLECTION_MISMATCH |
| 5th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;CONTINENT_DERIVED_FROM_COUNTRY;INSTITUTION_COLLECTION_MISMATCH |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count;continent_derived_from_coordinates;institution_collection_mismatch | 109406 | |
| occurrence_status_inferred_from_individual_count;institution_collection_mismatch | 60883 | |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;institution_collection_mismatch | 37878 | 13.0% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_coordinates;taxon_match_higherrank;institution_collection_mismatch | 14042 | 4.8% |
| occurrence_status_inferred_from_individual_count;taxon_match_higherrank;institution_collection_mismatch | 12982 | 4.5% |
| institution_collection_mismatch | 10900 | 3.7% |
| continent_derived_from_coordinates;institution_collection_mismatch | 8265 | 2.8% |
| continent_derived_from_country;institution_collection_mismatch | 6709 | 2.3% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;taxon_match_higherrank;institution_collection_mismatch | 5077 | 1.7% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_coordinates;taxon_match_fuzzy;institution_collection_mismatch | 3620 | 1.2% |
| Other values (68) | 21136 | 7.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 3094029 | |
| T | 2942498 | |
| N | 2807553 | |
| _ | 2589229 | 8.7% |
| O | 2467649 | 8.3% |
| C | 2378719 | 8.0% |
| E | 2118380 | 7.1% |
| R | 1991055 | 6.7% |
| U | 1407852 | 4.7% |
| D | 1340041 | 4.5% |
| Other values (15) | 6634776 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 26666823 | |
| Connector Punctuation | 2589229 | 8.7% |
| Other Punctuation | 515729 | 1.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 3094029 | |
| T | 2942498 | |
| N | 2807553 | |
| O | 2467649 | |
| C | 2378719 | |
| E | 2118380 | |
| R | 1991055 | |
| U | 1407852 | 5.3% |
| D | 1340041 | 5.0% |
| S | 1254931 | 4.7% |
| Other values (13) | 4864116 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2589229 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 515729 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26666823 | |
| Common | 3104958 | 10.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 3094029 | |
| T | 2942498 | |
| N | 2807553 | |
| O | 2467649 | |
| C | 2378719 | |
| E | 2118380 | |
| R | 1991055 | |
| U | 1407852 | 5.3% |
| D | 1340041 | 5.0% |
| S | 1254931 | 4.7% |
| Other values (13) | 4864116 |
Common
| Value | Count | Frequency (%) |
| _ | 2589229 | |
| ; | 515729 | 16.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29771781 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 3094029 | |
| T | 2942498 | |
| N | 2807553 | |
| _ | 2589229 | 8.7% |
| O | 2467649 | 8.3% |
| C | 2378719 | 8.0% |
| E | 2118380 | 7.1% |
| R | 1991055 | 6.7% |
| U | 1407852 | 4.7% |
| D | 1340041 | 4.5% |
| Other values (15) | 6634776 |
mediaType
Text
Missing 
| Distinct | 83 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 207500 |
| Missing (%) | 71.3% |
| Memory size | 2.2 MiB |
Length
| Max length | 231 |
|---|---|
| Median length | 21 |
| Mean length | 21.95310439 |
| Min length | 10 |
Unique
| Unique | 43 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage;StillImage;StillImage;StillImage;StillImage;StillImage |
| 3rd row | StillImage |
| 4th row | StillImage |
| 5th row | StillImage;StillImage;StillImage;StillImage |
| Value | Count | Frequency (%) |
| stillimage;stillimage | 74683 | |
| stillimage;stillimage;stillimage | 3977 | 4.8% |
| stillimage | 3155 | 3.8% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 396 | 0.5% |
| stillimage;stillimage;stillimage;stillimage | 332 | 0.4% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 315 | 0.4% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 123 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 59 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 30 | < 0.1% |
| movingimage;stillimage;stillimage;stillimage | 26 | < 0.1% |
| Other values (73) | 302 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 347418 | |
| g | 174283 | |
| i | 173996 | |
| I | 173996 | |
| m | 173996 | |
| a | 173996 | |
| e | 173996 | |
| S | 173709 | |
| t | 173709 | |
| ; | 90598 | 4.9% |
| Other values (4) | 1148 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1392255 | |
| Uppercase Letter | 347992 | 19.0% |
| Other Punctuation | 90598 | 4.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 347418 | |
| g | 174283 | |
| i | 173996 | |
| m | 173996 | |
| a | 173996 | |
| e | 173996 | |
| t | 173709 | |
| o | 287 | < 0.1% |
| v | 287 | < 0.1% |
| n | 287 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 173996 | |
| S | 173709 | |
| M | 287 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 90598 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1740247 | |
| Common | 90598 | 4.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 347418 | |
| g | 174283 | |
| i | 173996 | |
| I | 173996 | |
| m | 173996 | |
| a | 173996 | |
| e | 173996 | |
| S | 173709 | |
| t | 173709 | |
| M | 287 | < 0.1% |
| Other values (3) | 861 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| ; | 90598 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1830845 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 347418 | |
| g | 174283 | |
| i | 173996 | |
| I | 173996 | |
| m | 173996 | |
| a | 173996 | |
| e | 173996 | |
| S | 173709 | |
| t | 173709 | |
| ; | 90598 | 4.9% |
| Other values (4) | 1148 | 0.1% |
hasCoordinate
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.478215732 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | true |
| 3rd row | false |
| 4th row | true |
| 5th row | false |
| Value | Count | Frequency (%) |
| true | 151786 | |
| false | 139112 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 290898 | |
| t | 151786 | |
| r | 151786 | |
| u | 151786 | |
| f | 139112 | |
| a | 139112 | |
| l | 139112 | |
| s | 139112 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1302704 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 290898 | |
| t | 151786 | |
| r | 151786 | |
| u | 151786 | |
| f | 139112 | |
| a | 139112 | |
| l | 139112 | |
| s | 139112 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1302704 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 290898 | |
| t | 151786 | |
| r | 151786 | |
| u | 151786 | |
| f | 139112 | |
| a | 139112 | |
| l | 139112 | |
| s | 139112 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1302704 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 290898 | |
| t | 151786 | |
| r | 151786 | |
| u | 151786 | |
| f | 139112 | |
| a | 139112 | |
| l | 139112 | |
| s | 139112 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.983825946 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 286193 | |
| true | 4705 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 290898 | |
| f | 286193 | |
| a | 286193 | |
| l | 286193 | |
| s | 286193 | |
| t | 4705 | 0.3% |
| r | 4705 | 0.3% |
| u | 4705 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1449785 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 290898 | |
| f | 286193 | |
| a | 286193 | |
| l | 286193 | |
| s | 286193 | |
| t | 4705 | 0.3% |
| r | 4705 | 0.3% |
| u | 4705 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1449785 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 290898 | |
| f | 286193 | |
| a | 286193 | |
| l | 286193 | |
| s | 286193 | |
| t | 4705 | 0.3% |
| r | 4705 | 0.3% |
| u | 4705 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1449785 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 290898 | |
| f | 286193 | |
| a | 286193 | |
| l | 286193 | |
| s | 286193 | |
| t | 4705 | 0.3% |
| r | 4705 | 0.3% |
| u | 4705 | 0.3% |
taxonKey
Text
| Distinct | 15605 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.990388384 |
| Min length | 1 |
Unique
| Unique | 3323 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | 2484620 |
|---|---|
| 2nd row | 7342142 |
| 3rd row | 2479504 |
| 4th row | 6170652 |
| 5th row | 6170887 |
| Value | Count | Frequency (%) |
| 5231191 | 1635 | 0.6% |
| 6172874 | 1489 | 0.5% |
| 6171845 | 1145 | 0.4% |
| 2480242 | 1135 | 0.4% |
| 7191198 | 1017 | 0.3% |
| 7341902 | 981 | 0.3% |
| 9156140 | 924 | 0.3% |
| 2481137 | 897 | 0.3% |
| 7192432 | 862 | 0.3% |
| 8990910 | 856 | 0.3% |
| Other values (15595) | 279957 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 265852 | |
| 4 | 249808 | |
| 1 | 240119 | |
| 7 | 230542 | |
| 9 | 222471 | |
| 8 | 182841 | |
| 6 | 181328 | |
| 0 | 167091 | |
| 5 | 149144 | |
| 3 | 144294 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2033490 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 265852 | |
| 4 | 249808 | |
| 1 | 240119 | |
| 7 | 230542 | |
| 9 | 222471 | |
| 8 | 182841 | |
| 6 | 181328 | |
| 0 | 167091 | |
| 5 | 149144 | |
| 3 | 144294 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2033490 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 265852 | |
| 4 | 249808 | |
| 1 | 240119 | |
| 7 | 230542 | |
| 9 | 222471 | |
| 8 | 182841 | |
| 6 | 181328 | |
| 0 | 167091 | |
| 5 | 149144 | |
| 3 | 144294 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2033490 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 265852 | |
| 4 | 249808 | |
| 1 | 240119 | |
| 7 | 230542 | |
| 9 | 222471 | |
| 8 | 182841 | |
| 6 | 181328 | |
| 0 | 167091 | |
| 5 | 149144 | |
| 3 | 144294 |
acceptedTaxonKey
Text
| Distinct | 14746 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 2.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.988418581 |
| Min length | 1 |
Unique
| Unique | 3066 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | 2484620 |
|---|---|
| 2nd row | 7342142 |
| 3rd row | 2479504 |
| 4th row | 6170652 |
| 5th row | 6170887 |
| Value | Count | Frequency (%) |
| 5231191 | 1635 | 0.6% |
| 6172874 | 1489 | 0.5% |
| 6065824 | 1204 | 0.4% |
| 6171845 | 1145 | 0.4% |
| 2480242 | 1135 | 0.4% |
| 7191198 | 1017 | 0.3% |
| 7341902 | 981 | 0.3% |
| 9156140 | 924 | 0.3% |
| 7192432 | 869 | 0.3% |
| 8990910 | 856 | 0.3% |
| Other values (14736) | 279642 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 262848 | |
| 4 | 256924 | |
| 1 | 235914 | |
| 7 | 227945 | |
| 9 | 218745 | |
| 8 | 186874 | |
| 6 | 180095 | |
| 0 | 173457 | |
| 5 | 148365 | |
| 3 | 141743 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2032910 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 262848 | |
| 4 | 256924 | |
| 1 | 235914 | |
| 7 | 227945 | |
| 9 | 218745 | |
| 8 | 186874 | |
| 6 | 180095 | |
| 0 | 173457 | |
| 5 | 148365 | |
| 3 | 141743 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2032910 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 262848 | |
| 4 | 256924 | |
| 1 | 235914 | |
| 7 | 227945 | |
| 9 | 218745 | |
| 8 | 186874 | |
| 6 | 180095 | |
| 0 | 173457 | |
| 5 | 148365 | |
| 3 | 141743 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2032910 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 262848 | |
| 4 | 256924 | |
| 1 | 235914 | |
| 7 | 227945 | |
| 9 | 218745 | |
| 8 | 186874 | |
| 6 | 180095 | |
| 0 | 173457 | |
| 5 | 148365 | |
| 3 | 141743 |
kingdomKey
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 290897 | |
| 0 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 290897 | |
| 0 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 290898 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 290897 | |
| 0 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 290898 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 290897 | |
| 0 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 290898 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 290897 | |
| 0 | 1 | < 0.1% |
phylumKey
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 766 |
| Missing (%) | 0.3% |
| Memory size | 2.2 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 44 |
|---|---|
| 2nd row | 44 |
| 3rd row | 44 |
| 4th row | 44 |
| 5th row | 44 |
| Value | Count | Frequency (%) |
| 44 | 290063 | |
| 54 | 67 | < 0.1% |
| 52 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 580193 | |
| 5 | 69 | < 0.1% |
| 2 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 580264 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 580193 | |
| 5 | 69 | < 0.1% |
| 2 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 580264 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 580193 | |
| 5 | 69 | < 0.1% |
| 2 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 580264 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 580193 | |
| 5 | 69 | < 0.1% |
| 2 | 2 | < 0.1% |
classKey
Text
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 770 |
| Missing (%) | 0.3% |
| Memory size | 2.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.000017234 |
| Min length | 3 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 212 |
|---|---|
| 2nd row | 212 |
| 3rd row | 212 |
| 4th row | 212 |
| 5th row | 212 |
| Value | Count | Frequency (%) |
| 212 | 290053 | |
| 216 | 66 | < 0.1% |
| 359 | 6 | < 0.1% |
| 137 | 1 | < 0.1% |
| 11592253 | 1 | < 0.1% |
| 229 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 580176 | |
| 1 | 290122 | |
| 6 | 66 | < 0.1% |
| 3 | 8 | < 0.1% |
| 5 | 8 | < 0.1% |
| 9 | 8 | < 0.1% |
| 7 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 870389 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 580176 | |
| 1 | 290122 | |
| 6 | 66 | < 0.1% |
| 3 | 8 | < 0.1% |
| 5 | 8 | < 0.1% |
| 9 | 8 | < 0.1% |
| 7 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 870389 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 580176 | |
| 1 | 290122 | |
| 6 | 66 | < 0.1% |
| 3 | 8 | < 0.1% |
| 5 | 8 | < 0.1% |
| 9 | 8 | < 0.1% |
| 7 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 870389 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 580176 | |
| 1 | 290122 | |
| 6 | 66 | < 0.1% |
| 3 | 8 | < 0.1% |
| 5 | 8 | < 0.1% |
| 9 | 8 | < 0.1% |
| 7 | 1 | < 0.1% |
orderKey
Text
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1492 |
| Missing (%) | 0.5% |
| Memory size | 2.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 4.111473155 |
| Min length | 3 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 729 |
|---|---|
| 2nd row | 729 |
| 3rd row | 1445 |
| 4th row | 1445 |
| 5th row | 1445 |
| Value | Count | Frequency (%) |
| 729 | 145670 | |
| 7192402 | 33385 | 11.5% |
| 7191147 | 10340 | 3.6% |
| 1108 | 10163 | 3.5% |
| 1446 | 9902 | 3.4% |
| 724 | 8355 | 2.9% |
| 723 | 7462 | 2.6% |
| 1448 | 7409 | 2.6% |
| 7190953 | 7141 | 2.5% |
| 1447 | 7019 | 2.4% |
| Other values (43) | 42560 | 14.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 256129 | |
| 2 | 240651 | |
| 9 | 230658 | |
| 1 | 163879 | |
| 4 | 141291 | |
| 0 | 63100 | 5.3% |
| 5 | 32060 | 2.7% |
| 8 | 26025 | 2.2% |
| 3 | 22156 | 1.9% |
| 6 | 13936 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1189885 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 256129 | |
| 2 | 240651 | |
| 9 | 230658 | |
| 1 | 163879 | |
| 4 | 141291 | |
| 0 | 63100 | 5.3% |
| 5 | 32060 | 2.7% |
| 8 | 26025 | 2.2% |
| 3 | 22156 | 1.9% |
| 6 | 13936 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1189885 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 256129 | |
| 2 | 240651 | |
| 9 | 230658 | |
| 1 | 163879 | |
| 4 | 141291 | |
| 0 | 63100 | 5.3% |
| 5 | 32060 | 2.7% |
| 8 | 26025 | 2.2% |
| 3 | 22156 | 1.9% |
| 6 | 13936 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1189885 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 256129 | |
| 2 | 240651 | |
| 9 | 230658 | |
| 1 | 163879 | |
| 4 | 141291 | |
| 0 | 63100 | 5.3% |
| 5 | 32060 | 2.7% |
| 8 | 26025 | 2.2% |
| 3 | 22156 | 1.9% |
| 6 | 13936 | 1.2% |
familyKey
Text
| Distinct | 250 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1542 |
| Missing (%) | 0.5% |
| Memory size | 2.2 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.279527641 |
| Min length | 4 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 5295 |
|---|---|
| 2nd row | 5290 |
| 3rd row | 9340 |
| 4th row | 9340 |
| 5th row | 9340 |
| Value | Count | Frequency (%) |
| 5282 | 11999 | 4.1% |
| 9322 | 11304 | 3.9% |
| 2986 | 10104 | 3.5% |
| 2877 | 9920 | 3.4% |
| 5233 | 9902 | 3.4% |
| 9316 | 9646 | 3.3% |
| 5242 | 8978 | 3.1% |
| 5235 | 7653 | 2.6% |
| 5290 | 6987 | 2.4% |
| 9340 | 6737 | 2.3% |
| Other values (240) | 196126 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 236924 | |
| 9 | 187320 | |
| 3 | 175452 | |
| 5 | 170061 | |
| 8 | 93668 | 7.6% |
| 0 | 86294 | 7.0% |
| 1 | 78151 | 6.3% |
| 7 | 70985 | 5.7% |
| 4 | 70809 | 5.7% |
| 6 | 68643 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1238307 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 236924 | |
| 9 | 187320 | |
| 3 | 175452 | |
| 5 | 170061 | |
| 8 | 93668 | 7.6% |
| 0 | 86294 | 7.0% |
| 1 | 78151 | 6.3% |
| 7 | 70985 | 5.7% |
| 4 | 70809 | 5.7% |
| 6 | 68643 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1238307 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 236924 | |
| 9 | 187320 | |
| 3 | 175452 | |
| 5 | 170061 | |
| 8 | 93668 | 7.6% |
| 0 | 86294 | 7.0% |
| 1 | 78151 | 6.3% |
| 7 | 70985 | 5.7% |
| 4 | 70809 | 5.7% |
| 6 | 68643 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1238307 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 236924 | |
| 9 | 187320 | |
| 3 | 175452 | |
| 5 | 170061 | |
| 8 | 93668 | 7.6% |
| 0 | 86294 | 7.0% |
| 1 | 78151 | 6.3% |
| 7 | 70985 | 5.7% |
| 4 | 70809 | 5.7% |
| 6 | 68643 | 5.5% |
genusKey
Text
| Distinct | 2200 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 1404 |
| Missing (%) | 0.5% |
| Memory size | 2.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.002994881 |
| Min length | 7 |
Unique
| Unique | 162 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2484612 |
|---|---|
| 2nd row | 2490714 |
| 3rd row | 2479503 |
| 4th row | 9623552 |
| 5th row | 2479496 |
| Value | Count | Frequency (%) |
| 2490714 | 5647 | 2.0% |
| 2481739 | 3893 | 1.3% |
| 2480996 | 3593 | 1.2% |
| 2486114 | 3364 | 1.2% |
| 2492321 | 3110 | 1.1% |
| 9405810 | 2980 | 1.0% |
| 2492941 | 2742 | 0.9% |
| 2491468 | 2716 | 0.9% |
| 2481126 | 2634 | 0.9% |
| 2482468 | 2475 | 0.9% |
| Other values (2190) | 256340 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 406143 | |
| 2 | 385130 | |
| 9 | 236025 | |
| 8 | 233741 | |
| 7 | 147304 | 7.3% |
| 1 | 135409 | 6.7% |
| 0 | 129989 | 6.4% |
| 3 | 125202 | 6.2% |
| 6 | 121813 | 6.0% |
| 5 | 106569 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2027325 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 406143 | |
| 2 | 385130 | |
| 9 | 236025 | |
| 8 | 233741 | |
| 7 | 147304 | 7.3% |
| 1 | 135409 | 6.7% |
| 0 | 129989 | 6.4% |
| 3 | 125202 | 6.2% |
| 6 | 121813 | 6.0% |
| 5 | 106569 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2027325 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 406143 | |
| 2 | 385130 | |
| 9 | 236025 | |
| 8 | 233741 | |
| 7 | 147304 | 7.3% |
| 1 | 135409 | 6.7% |
| 0 | 129989 | 6.4% |
| 3 | 125202 | 6.2% |
| 6 | 121813 | 6.0% |
| 5 | 106569 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2027325 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 406143 | |
| 2 | 385130 | |
| 9 | 236025 | |
| 8 | 233741 | |
| 7 | 147304 | 7.3% |
| 1 | 135409 | 6.7% |
| 0 | 129989 | 6.4% |
| 3 | 125202 | 6.2% |
| 6 | 121813 | 6.0% |
| 5 | 106569 | 5.3% |
speciesKey
Text
Missing 
| Distinct | 7224 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 10568 |
| Missing (%) | 3.6% |
| Memory size | 2.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.006963222 |
| Min length | 7 |
Unique
| Unique | 863 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 2484620 |
|---|---|
| 2nd row | 2490774 |
| 3rd row | 2479504 |
| 4th row | 2479311 |
| 5th row | 2479498 |
| Value | Count | Frequency (%) |
| 5231190 | 1816 | 0.6% |
| 9809229 | 1734 | 0.6% |
| 5229493 | 1415 | 0.5% |
| 2490719 | 1395 | 0.5% |
| 2494422 | 1359 | 0.5% |
| 7901064 | 1357 | 0.5% |
| 9616058 | 1340 | 0.5% |
| 9705453 | 1219 | 0.4% |
| 6065824 | 1204 | 0.4% |
| 2480637 | 1195 | 0.4% |
| Other values (7214) | 266296 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 362903 | |
| 4 | 315405 | |
| 9 | 214561 | |
| 8 | 207833 | |
| 7 | 153593 | |
| 5 | 151542 | |
| 0 | 150344 | |
| 1 | 146530 | |
| 3 | 135043 | 6.9% |
| 6 | 126508 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1964262 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 362903 | |
| 4 | 315405 | |
| 9 | 214561 | |
| 8 | 207833 | |
| 7 | 153593 | |
| 5 | 151542 | |
| 0 | 150344 | |
| 1 | 146530 | |
| 3 | 135043 | 6.9% |
| 6 | 126508 | 6.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1964262 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 362903 | |
| 4 | 315405 | |
| 9 | 214561 | |
| 8 | 207833 | |
| 7 | 153593 | |
| 5 | 151542 | |
| 0 | 150344 | |
| 1 | 146530 | |
| 3 | 135043 | 6.9% |
| 6 | 126508 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1964262 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 362903 | |
| 4 | 315405 | |
| 9 | 214561 | |
| 8 | 207833 | |
| 7 | 153593 | |
| 5 | 151542 | |
| 0 | 150344 | |
| 1 | 146530 | |
| 3 | 135043 | 6.9% |
| 6 | 126508 | 6.4% |
species
Text
Missing 
| Distinct | 7216 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 10568 |
| Missing (%) | 3.6% |
| Memory size | 2.2 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 28 |
| Mean length | 17.82155317 |
| Min length | 9 |
Unique
| Unique | 862 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Vidua orientalis |
|---|---|
| 2nd row | Turdus viscivorus |
| 3rd row | Neophema splendida |
| 4th row | Platycercus elegans |
| 5th row | Polytelis anthopeplus |
| Value | Count | Frequency (%) |
| turdus | 5630 | 1.0% |
| calidris | 3874 | 0.7% |
| falco | 3588 | 0.6% |
| passer | 3103 | 0.6% |
| pycnonotus | 3090 | 0.6% |
| accipiter | 2976 | 0.5% |
| sylvia | 2721 | 0.5% |
| emberiza | 2715 | 0.5% |
| larus | 2625 | 0.5% |
| buteo | 2500 | 0.4% |
| Other values (6121) | 528306 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 529984 | 10.6% |
| s | 420166 | 8.4% |
| i | 417749 | 8.4% |
| r | 355972 | 7.1% |
| u | 347702 | 7.0% |
| o | 311426 | 6.2% |
| e | 289384 | 5.8% |
| l | 285661 | 5.7% |
| 280798 | 5.6% | |
| n | 251540 | 5.0% |
| Other values (43) | 1505534 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4434788 | |
| Space Separator | 280798 | 5.6% |
| Uppercase Letter | 280330 | 5.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 529984 | |
| s | 420166 | |
| i | 417749 | |
| r | 355972 | 8.0% |
| u | 347702 | 7.8% |
| o | 311426 | 7.0% |
| e | 289384 | 6.5% |
| l | 285661 | 6.4% |
| n | 251540 | 5.7% |
| c | 250267 | 5.6% |
| Other values (16) | 974937 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 45208 | |
| C | 44095 | |
| A | 32371 | |
| T | 22022 | |
| S | 21712 | |
| M | 17089 | 6.1% |
| L | 16790 | 6.0% |
| G | 11340 | 4.0% |
| E | 9976 | 3.6% |
| F | 9059 | 3.2% |
| Other values (16) | 50668 |
Space Separator
| Value | Count | Frequency (%) |
| 280798 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4715118 | |
| Common | 280798 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 529984 | |
| s | 420166 | 8.9% |
| i | 417749 | 8.9% |
| r | 355972 | 7.5% |
| u | 347702 | 7.4% |
| o | 311426 | 6.6% |
| e | 289384 | 6.1% |
| l | 285661 | 6.1% |
| n | 251540 | 5.3% |
| c | 250267 | 5.3% |
| Other values (42) | 1255267 |
Common
| Value | Count | Frequency (%) |
| 280798 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4995916 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 529984 | 10.6% |
| s | 420166 | 8.4% |
| i | 417749 | 8.4% |
| r | 355972 | 7.1% |
| u | 347702 | 7.0% |
| o | 311426 | 6.2% |
| e | 289384 | 5.8% |
| l | 285661 | 5.7% |
| 280798 | 5.6% | |
| n | 251540 | 5.0% |
| Other values (43) | 1505534 |
| Distinct | 14746 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 2.2 MiB |
Length
| Max length | 75 |
|---|---|
| Median length | 64 |
| Mean length | 33.57232629 |
| Min length | 4 |
Unique
| Unique | 3066 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | Vidua orientalis Heuglin, 1870 |
|---|---|
| 2nd row | Turdus viscivorus viscivorus |
| 3rd row | Neophema splendida (Gould, 1841) |
| 4th row | Platycercus elegans melanopterus North, 1906 |
| 5th row | Polytelis anthopeplus monarchoides Schodde, 1993 |
| Value | Count | Frequency (%) |
| linnaeus | 46619 | 4.1% |
| 1758 | 36632 | 3.2% |
| temminck | 9316 | 0.8% |
| 1766 | 8978 | 0.8% |
| vieillot | 8962 | 0.8% |
| 8270 | 0.7% | |
| 1789 | 7244 | 0.6% |
| 1821 | 6618 | 0.6% |
| horsfield | 6307 | 0.6% |
| gmelin | 5819 | 0.5% |
| Other values (9799) | 988193 |
Most occurring characters
| Value | Count | Frequency (%) |
| 842061 | 8.6% | |
| a | 828778 | 8.5% |
| i | 702778 | 7.2% |
| s | 656289 | 6.7% |
| e | 559707 | 5.7% |
| r | 534908 | 5.5% |
| u | 520424 | 5.3% |
| n | 514076 | 5.3% |
| l | 461054 | 4.7% |
| o | 460789 | 4.7% |
| Other values (72) | 3685225 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7139395 | |
| Space Separator | 842061 | 8.6% |
| Decimal Number | 765010 | 7.8% |
| Uppercase Letter | 537011 | 5.5% |
| Other Punctuation | 243588 | 2.5% |
| Close Punctuation | 119038 | 1.2% |
| Open Punctuation | 119038 | 1.2% |
| Dash Punctuation | 947 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 828778 | |
| i | 702778 | |
| s | 656289 | |
| e | 559707 | 7.8% |
| r | 534908 | 7.5% |
| u | 520424 | 7.3% |
| n | 514076 | 7.2% |
| l | 461054 | 6.5% |
| o | 460789 | 6.5% |
| c | 356956 | 5.0% |
| Other values (25) | 1543636 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 81449 | |
| P | 55576 | |
| C | 54000 | |
| S | 46590 | |
| A | 36902 | 6.9% |
| G | 34502 | 6.4% |
| T | 34045 | 6.3% |
| B | 27485 | 5.1% |
| M | 26544 | 4.9% |
| H | 24693 | 4.6% |
| Other values (17) | 115225 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 230603 | |
| 8 | 167122 | |
| 7 | 102767 | |
| 5 | 56503 | 7.4% |
| 6 | 44366 | 5.8% |
| 9 | 43486 | 5.7% |
| 2 | 37523 | 4.9% |
| 3 | 33722 | 4.4% |
| 0 | 26573 | 3.5% |
| 4 | 22345 | 2.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 191266 | |
| . | 43873 | 18.0% |
| & | 8267 | 3.4% |
| ' | 179 | 0.1% |
| ? | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 842061 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 119038 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 119038 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 947 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7676406 | |
| Common | 2089683 | 21.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 828778 | 10.8% |
| i | 702778 | 9.2% |
| s | 656289 | 8.5% |
| e | 559707 | 7.3% |
| r | 534908 | 7.0% |
| u | 520424 | 6.8% |
| n | 514076 | 6.7% |
| l | 461054 | 6.0% |
| o | 460789 | 6.0% |
| c | 356956 | 4.7% |
| Other values (52) | 2080647 |
Common
| Value | Count | Frequency (%) |
| 842061 | ||
| 1 | 230603 | 11.0% |
| , | 191266 | 9.2% |
| 8 | 167122 | 8.0% |
| ) | 119038 | 5.7% |
| ( | 119038 | 5.7% |
| 7 | 102767 | 4.9% |
| 5 | 56503 | 2.7% |
| 6 | 44366 | 2.1% |
| . | 43873 | 2.1% |
| Other values (10) | 173046 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9762382 | |
| None | 3707 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 842061 | 8.6% | |
| a | 828778 | 8.5% |
| i | 702778 | 7.2% |
| s | 656289 | 6.7% |
| e | 559707 | 5.7% |
| r | 534908 | 5.5% |
| u | 520424 | 5.3% |
| n | 514076 | 5.3% |
| l | 461054 | 4.7% |
| o | 460789 | 4.7% |
| Other values (61) | 3681518 |
None
| Value | Count | Frequency (%) |
| ü | 2457 | |
| ø | 471 | 12.7% |
| é | 262 | 7.1% |
| ä | 242 | 6.5% |
| á | 177 | 4.8% |
| É | 42 | 1.1% |
| è | 40 | 1.1% |
| ë | 5 | 0.1% |
| ñ | 5 | 0.1% |
| ö | 5 | 0.1% |
| Distinct | 27734 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 122 |
|---|---|
| Median length | 73 |
| Mean length | 38.15606845 |
| Min length | 3 |
Unique
| Unique | 8751 ? |
|---|---|
| Unique (%) | 3.0% |
Sample
| 1st row | Vidua orientalis cf Heuglin, 1871 |
|---|---|
| 2nd row | Turdus viscivorus viscivorus Linnaeus, 1758 |
| 3rd row | Neophema splendida Gould, 1841 |
| 4th row | Platycercus elegans melanopterus North, 1906 |
| 5th row | Polytelis anthopeplus monarchoides |
| Value | Count | Frequency (%) |
| linnaeus | 87601 | 6.6% |
| 1758 | 63048 | 4.8% |
| temminck | 13095 | 1.0% |
| vieillot | 10951 | 0.8% |
| 10616 | 0.8% | |
| gmelin | 9488 | 0.7% |
| horsfield | 8374 | 0.6% |
| 1766 | 7987 | 0.6% |
| 1789 | 5944 | 0.5% |
| 1821 | 5917 | 0.4% |
| Other values (11808) | 1095813 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1027983 | 9.3% | |
| a | 957166 | 8.6% |
| i | 794136 | 7.2% |
| s | 749765 | 6.8% |
| e | 662963 | 6.0% |
| n | 638053 | 5.7% |
| r | 590663 | 5.3% |
| u | 588433 | 5.3% |
| l | 507153 | 4.6% |
| o | 490037 | 4.4% |
| Other values (89) | 4093172 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8068871 | |
| Space Separator | 1027983 | 9.3% |
| Decimal Number | 872433 | 7.9% |
| Uppercase Letter | 606433 | 5.5% |
| Other Punctuation | 282499 | 2.5% |
| Open Punctuation | 120086 | 1.1% |
| Close Punctuation | 120028 | 1.1% |
| Dash Punctuation | 908 | < 0.1% |
| Math Symbol | 282 | < 0.1% |
| Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 957166 | |
| i | 794136 | |
| s | 749765 | |
| e | 662963 | 8.2% |
| n | 638053 | 7.9% |
| r | 590663 | 7.3% |
| u | 588433 | 7.3% |
| l | 507153 | 6.3% |
| o | 490037 | 6.1% |
| t | 388478 | 4.8% |
| Other values (30) | 1702024 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 123494 | |
| P | 57198 | |
| C | 50372 | |
| S | 50352 | |
| T | 36890 | 6.1% |
| A | 36500 | 6.0% |
| G | 35206 | 5.8% |
| B | 31451 | 5.2% |
| M | 30799 | 5.1% |
| H | 30507 | 5.0% |
| Other values (16) | 123664 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 229223 | |
| . | 42279 | 15.0% |
| & | 9861 | 3.5% |
| ' | 558 | 0.2% |
| ? | 301 | 0.1% |
| " | 142 | 0.1% |
| / | 69 | < 0.1% |
| : | 43 | < 0.1% |
| \ | 16 | < 0.1% |
| ! | 4 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 255589 | |
| 8 | 193807 | |
| 7 | 127628 | |
| 5 | 83250 | 9.5% |
| 9 | 44056 | 5.0% |
| 6 | 43070 | 4.9% |
| 2 | 39450 | 4.5% |
| 3 | 33961 | 3.9% |
| 4 | 26105 | 3.0% |
| 0 | 25517 | 2.9% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 140 | |
| > | 131 | |
| = | 9 | 3.2% |
| ∩ | 2 | 0.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 119957 | |
| ] | 42 | < 0.1% |
| } | 29 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 120015 | |
| [ | 71 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1027983 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 908 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8675294 | |
| Common | 2424220 | 21.8% |
| Greek | 10 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 957166 | |
| i | 794136 | 9.2% |
| s | 749765 | 8.6% |
| e | 662963 | 7.6% |
| n | 638053 | 7.4% |
| r | 590663 | 6.8% |
| u | 588433 | 6.8% |
| l | 507153 | 5.8% |
| o | 490037 | 5.6% |
| t | 388478 | 4.5% |
| Other values (55) | 2308447 |
Common
| Value | Count | Frequency (%) |
| 1027983 | ||
| 1 | 255589 | 10.5% |
| , | 229223 | 9.5% |
| 8 | 193807 | 8.0% |
| 7 | 127628 | 5.3% |
| ( | 120015 | 5.0% |
| ) | 119957 | 4.9% |
| 5 | 83250 | 3.4% |
| 9 | 44056 | 1.8% |
| 6 | 43070 | 1.8% |
| Other values (23) | 179642 | 7.4% |
Greek
| Value | Count | Frequency (%) |
| δ | 10 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11090283 | |
| None | 9239 | 0.1% |
| Math Operators | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1027983 | 9.3% | |
| a | 957166 | 8.6% |
| i | 794136 | 7.2% |
| s | 749765 | 6.8% |
| e | 662963 | 6.0% |
| n | 638053 | 5.8% |
| r | 590663 | 5.3% |
| u | 588433 | 5.3% |
| l | 507153 | 4.6% |
| o | 490037 | 4.4% |
| Other values (74) | 4083931 |
None
| Value | Count | Frequency (%) |
| ü | 7442 | |
| é | 473 | 5.1% |
| ø | 466 | 5.0% |
| ä | 384 | 4.2% |
| á | 246 | 2.7% |
| ö | 58 | 0.6% |
| ï | 55 | 0.6% |
| ë | 52 | 0.6% |
| è | 46 | 0.5% |
| δ | 10 | 0.1% |
| Other values (4) | 7 | 0.1% |
Math Operators
| Value | Count | Frequency (%) |
| ∩ | 2 |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DWC_ARCHIVE |
|---|---|
| 2nd row | DWC_ARCHIVE |
| 3rd row | DWC_ARCHIVE |
| 4th row | DWC_ARCHIVE |
| 5th row | DWC_ARCHIVE |
| Value | Count | Frequency (%) |
| dwc_archive | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 581796 | |
| D | 290898 | |
| W | 290898 | |
| _ | 290898 | |
| A | 290898 | |
| R | 290898 | |
| H | 290898 | |
| I | 290898 | |
| V | 290898 | |
| E | 290898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2908980 | |
| Connector Punctuation | 290898 | 9.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 581796 | |
| D | 290898 | |
| W | 290898 | |
| A | 290898 | |
| R | 290898 | |
| H | 290898 | |
| I | 290898 | |
| V | 290898 | |
| E | 290898 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 290898 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2908980 | |
| Common | 290898 | 9.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 581796 | |
| D | 290898 | |
| W | 290898 | |
| A | 290898 | |
| R | 290898 | |
| H | 290898 | |
| I | 290898 | |
| V | 290898 | |
| E | 290898 |
Common
| Value | Count | Frequency (%) |
| _ | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3199878 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 581796 | |
| D | 290898 | |
| W | 290898 | |
| _ | 290898 | |
| A | 290898 | |
| R | 290898 | |
| H | 290898 | |
| I | 290898 | |
| V | 290898 | |
| E | 290898 |
lastParsed
Text
| Distinct | 24995 |
|---|---|
| Distinct (%) | 8.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99487105 |
| Min length | 20 |
Unique
| Unique | 2133 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 2025-01-03T11:41:38.952Z |
|---|---|
| 2nd row | 2025-01-03T11:41:39.036Z |
| 3rd row | 2025-01-03T11:41:41.369Z |
| 4th row | 2025-01-03T11:41:41.370Z |
| 5th row | 2025-01-03T11:41:41.379Z |
| Value | Count | Frequency (%) |
| 2025-01-03t11:42:05.126z | 149 | 0.1% |
| 2025-01-03t11:42:05.124z | 149 | 0.1% |
| 2025-01-03t11:42:05.005z | 148 | 0.1% |
| 2025-01-03t11:42:05.125z | 146 | 0.1% |
| 2025-01-03t11:42:05.127z | 145 | < 0.1% |
| 2025-01-03t11:42:05.122z | 145 | < 0.1% |
| 2025-01-03t11:42:05.010z | 142 | < 0.1% |
| 2025-01-03t11:42:04.999z | 141 | < 0.1% |
| 2025-01-03t11:42:05.042z | 139 | < 0.1% |
| 2025-01-03t11:42:04.998z | 138 | < 0.1% |
| Other values (24985) | 289456 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1161663 | |
| 0 | 1129747 | |
| 2 | 861543 | |
| - | 581796 | |
| : | 581796 | |
| 5 | 506791 | |
| 4 | 439261 | 6.3% |
| 3 | 387300 | 5.5% |
| T | 290898 | 4.2% |
| Z | 290898 | 4.2% |
| Other values (5) | 748367 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4944147 | |
| Other Punctuation | 872321 | 12.5% |
| Dash Punctuation | 581796 | 8.3% |
| Uppercase Letter | 581796 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1161663 | |
| 0 | 1129747 | |
| 2 | 861543 | |
| 5 | 506791 | |
| 4 | 439261 | 8.9% |
| 3 | 387300 | 7.8% |
| 8 | 122218 | 2.5% |
| 6 | 117907 | 2.4% |
| 9 | 114950 | 2.3% |
| 7 | 102767 | 2.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 581796 | |
| . | 290525 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 290898 | |
| Z | 290898 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 581796 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6398264 | |
| Latin | 581796 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1161663 | |
| 0 | 1129747 | |
| 2 | 861543 | |
| - | 581796 | |
| : | 581796 | |
| 5 | 506791 | |
| 4 | 439261 | 6.9% |
| 3 | 387300 | 6.1% |
| . | 290525 | 4.5% |
| 8 | 122218 | 1.9% |
| Other values (3) | 335624 | 5.2% |
Latin
| Value | Count | Frequency (%) |
| T | 290898 | |
| Z | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6980060 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1161663 | |
| 0 | 1129747 | |
| 2 | 861543 | |
| - | 581796 | |
| : | 581796 | |
| 5 | 506791 | |
| 4 | 439261 | 6.3% |
| 3 | 387300 | 5.5% |
| T | 290898 | 4.2% |
| Z | 290898 | 4.2% |
| Other values (5) | 748367 |
lastCrawled
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2025-01-03T11:34:30.428Z |
|---|---|
| 2nd row | 2025-01-03T11:34:30.428Z |
| 3rd row | 2025-01-03T11:34:30.428Z |
| 4th row | 2025-01-03T11:34:30.428Z |
| 5th row | 2025-01-03T11:34:30.428Z |
| Value | Count | Frequency (%) |
| 2025-01-03t11:34:30.428z | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1163592 | |
| 2 | 872694 | |
| 1 | 872694 | |
| 3 | 872694 | |
| - | 581796 | |
| : | 581796 | |
| 4 | 581796 | |
| 5 | 290898 | 4.2% |
| T | 290898 | 4.2% |
| . | 290898 | 4.2% |
| Other values (2) | 581796 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4945266 | |
| Other Punctuation | 872694 | 12.5% |
| Dash Punctuation | 581796 | 8.3% |
| Uppercase Letter | 581796 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1163592 | |
| 2 | 872694 | |
| 1 | 872694 | |
| 3 | 872694 | |
| 4 | 581796 | |
| 5 | 290898 | 5.9% |
| 8 | 290898 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 581796 | |
| . | 290898 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 290898 | |
| Z | 290898 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 581796 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6399756 | |
| Latin | 581796 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1163592 | |
| 2 | 872694 | |
| 1 | 872694 | |
| 3 | 872694 | |
| - | 581796 | |
| : | 581796 | |
| 4 | 581796 | |
| 5 | 290898 | 4.5% |
| . | 290898 | 4.5% |
| 8 | 290898 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 290898 | |
| Z | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6981552 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1163592 | |
| 2 | 872694 | |
| 1 | 872694 | |
| 3 | 872694 | |
| - | 581796 | |
| : | 581796 | |
| 4 | 581796 | |
| 5 | 290898 | 4.2% |
| T | 290898 | 4.2% |
| . | 290898 | 4.2% |
| Other values (2) | 581796 |
repatriated
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 46939 |
| Missing (%) | 16.1% |
| Memory size | 2.2 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.284781459 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | true |
| 3rd row | true |
| 4th row | true |
| 5th row | true |
| Value | Count | Frequency (%) |
| true | 174484 | |
| false | 69475 | 28.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 243959 | |
| t | 174484 | |
| r | 174484 | |
| u | 174484 | |
| f | 69475 | 6.6% |
| a | 69475 | 6.6% |
| l | 69475 | 6.6% |
| s | 69475 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1045311 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 243959 | |
| t | 174484 | |
| r | 174484 | |
| u | 174484 | |
| f | 69475 | 6.6% |
| a | 69475 | 6.6% |
| l | 69475 | 6.6% |
| s | 69475 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1045311 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 243959 | |
| t | 174484 | |
| r | 174484 | |
| u | 174484 | |
| f | 69475 | 6.6% |
| a | 69475 | 6.6% |
| l | 69475 | 6.6% |
| s | 69475 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1045311 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 243959 | |
| t | 174484 | |
| r | 174484 | |
| u | 174484 | |
| f | 69475 | 6.6% |
| a | 69475 | 6.6% |
| l | 69475 | 6.6% |
| s | 69475 | 6.6% |
isSequenced
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 290898 | |
| a | 290898 | |
| l | 290898 | |
| s | 290898 | |
| e | 290898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1454490 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 290898 | |
| a | 290898 | |
| l | 290898 | |
| s | 290898 | |
| e | 290898 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1454490 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 290898 | |
| a | 290898 | |
| l | 290898 | |
| s | 290898 | |
| e | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1454490 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 290898 | |
| a | 290898 | |
| l | 290898 | |
| s | 290898 | |
| e | 290898 |
gbifRegion
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 50475 |
| Missing (%) | 17.4% |
| Memory size | 2.2 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 6.326869725 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EUROPE |
|---|---|
| 2nd row | OCEANIA |
| 3rd row | OCEANIA |
| 4th row | OCEANIA |
| 5th row | AFRICA |
| Value | Count | Frequency (%) |
| asia | 91619 | |
| europe | 88150 | |
| latin_america | 31593 | 13.1% |
| africa | 19035 | 7.9% |
| north_america | 4960 | 2.1% |
| oceania | 4770 | 2.0% |
| antarctica | 296 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 336435 | |
| E | 217623 | |
| I | 183866 | |
| R | 148994 | |
| O | 97880 | 6.4% |
| S | 91619 | 6.0% |
| U | 88150 | 5.8% |
| P | 88150 | 5.8% |
| C | 60950 | 4.0% |
| N | 41619 | 2.7% |
| Other values (6) | 165839 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1484572 | |
| Connector Punctuation | 36553 | 2.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 336435 | |
| E | 217623 | |
| I | 183866 | |
| R | 148994 | |
| O | 97880 | 6.6% |
| S | 91619 | 6.2% |
| U | 88150 | 5.9% |
| P | 88150 | 5.9% |
| C | 60950 | 4.1% |
| N | 41619 | 2.8% |
| Other values (5) | 129286 | 8.7% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 36553 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1484572 | |
| Common | 36553 | 2.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 336435 | |
| E | 217623 | |
| I | 183866 | |
| R | 148994 | |
| O | 97880 | 6.6% |
| S | 91619 | 6.2% |
| U | 88150 | 5.9% |
| P | 88150 | 5.9% |
| C | 60950 | 4.1% |
| N | 41619 | 2.8% |
| Other values (5) | 129286 | 8.7% |
Common
| Value | Count | Frequency (%) |
| _ | 36553 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1521125 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 336435 | |
| E | 217623 | |
| I | 183866 | |
| R | 148994 | |
| O | 97880 | 6.4% |
| S | 91619 | 6.0% |
| U | 88150 | 5.8% |
| P | 88150 | 5.8% |
| C | 60950 | 4.0% |
| N | 41619 | 2.7% |
| Other values (6) | 165839 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EUROPE |
|---|---|
| 2nd row | EUROPE |
| 3rd row | EUROPE |
| 4th row | EUROPE |
| 5th row | EUROPE |
| Value | Count | Frequency (%) |
| europe | 290898 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 581796 | |
| U | 290898 | |
| R | 290898 | |
| O | 290898 | |
| P | 290898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1745388 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 581796 | |
| U | 290898 | |
| R | 290898 | |
| O | 290898 | |
| P | 290898 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1745388 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 581796 | |
| U | 290898 | |
| R | 290898 | |
| O | 290898 | |
| P | 290898 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1745388 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 581796 | |
| U | 290898 | |
| R | 290898 | |
| O | 290898 | |
| P | 290898 |
level0Gid
Text
Missing 
| Distinct | 217 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 158562 |
| Missing (%) | 54.5% |
| Memory size | 2.2 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 16 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NLD |
|---|---|
| 2nd row | AUS |
| 3rd row | GMB |
| 4th row | NZL |
| 5th row | MDG |
| Value | Count | Frequency (%) |
| nld | 46099 | |
| idn | 43479 | |
| sur | 3814 | 2.9% |
| usa | 1760 | 1.3% |
| gbr | 1501 | 1.1% |
| rus | 1424 | 1.1% |
| deu | 1330 | 1.0% |
| chn | 1268 | 1.0% |
| bra | 1179 | 0.9% |
| twn | 1083 | 0.8% |
| Other values (207) | 29399 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 97554 | |
| D | 94303 | |
| L | 52176 | |
| I | 46877 | |
| R | 12813 | 3.2% |
| U | 12497 | 3.1% |
| A | 12065 | 3.0% |
| S | 12061 | 3.0% |
| E | 7425 | 1.9% |
| G | 5996 | 1.5% |
| Other values (22) | 43241 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 396526 | |
| Decimal Number | 482 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 97554 | |
| D | 94303 | |
| L | 52176 | |
| I | 46877 | |
| R | 12813 | 3.2% |
| U | 12497 | 3.2% |
| A | 12065 | 3.0% |
| S | 12061 | 3.0% |
| E | 7425 | 1.9% |
| G | 5996 | 1.5% |
| Other values (16) | 42759 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 241 | |
| 1 | 180 | |
| 3 | 30 | 6.2% |
| 6 | 29 | 6.0% |
| 2 | 1 | 0.2% |
| 7 | 1 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 396526 | |
| Common | 482 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 97554 | |
| D | 94303 | |
| L | 52176 | |
| I | 46877 | |
| R | 12813 | 3.2% |
| U | 12497 | 3.2% |
| A | 12065 | 3.0% |
| S | 12061 | 3.0% |
| E | 7425 | 1.9% |
| G | 5996 | 1.5% |
| Other values (16) | 42759 |
Common
| Value | Count | Frequency (%) |
| 0 | 241 | |
| 1 | 180 | |
| 3 | 30 | 6.2% |
| 6 | 29 | 6.0% |
| 2 | 1 | 0.2% |
| 7 | 1 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 397008 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 97554 | |
| D | 94303 | |
| L | 52176 | |
| I | 46877 | |
| R | 12813 | 3.2% |
| U | 12497 | 3.1% |
| A | 12065 | 3.0% |
| S | 12061 | 3.0% |
| E | 7425 | 1.9% |
| G | 5996 | 1.5% |
| Other values (22) | 43241 |
level0Name
Text
Missing 
| Distinct | 217 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 158562 |
| Missing (%) | 54.5% |
| Memory size | 2.2 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 30 |
| Mean length | 9.584957986 |
| Min length | 4 |
Unique
| Unique | 16 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Netherlands |
|---|---|
| 2nd row | Australia |
| 3rd row | Gambia |
| 4th row | New Zealand |
| 5th row | Madagascar |
| Value | Count | Frequency (%) |
| netherlands | 46099 | |
| indonesia | 43479 | |
| suriname | 3814 | 2.6% |
| united | 3266 | 2.3% |
| states | 1764 | 1.2% |
| kingdom | 1501 | 1.0% |
| russia | 1424 | 1.0% |
| germany | 1330 | 0.9% |
| china | 1268 | 0.9% |
| and | 1222 | 0.8% |
| Other values (258) | 39462 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 166311 | |
| e | 163207 | |
| a | 144224 | |
| d | 103145 | 8.1% |
| s | 101363 | 8.0% |
| i | 80825 | 6.4% |
| r | 65408 | 5.2% |
| t | 61310 | 4.8% |
| l | 57995 | 4.6% |
| o | 54929 | 4.3% |
| Other values (53) | 269718 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1112019 | |
| Uppercase Letter | 143274 | 11.3% |
| Space Separator | 12293 | 1.0% |
| Other Punctuation | 442 | < 0.1% |
| Open Punctuation | 156 | < 0.1% |
| Close Punctuation | 156 | < 0.1% |
| Dash Punctuation | 95 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 166311 | |
| e | 163207 | |
| a | 144224 | |
| d | 103145 | |
| s | 101363 | |
| i | 80825 | |
| r | 65408 | 5.9% |
| t | 61310 | 5.5% |
| l | 57995 | 5.2% |
| o | 54929 | 4.9% |
| Other values (21) | 113302 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 47390 | |
| I | 46184 | |
| S | 11097 | 7.7% |
| C | 5174 | 3.6% |
| A | 4338 | 3.0% |
| T | 3736 | 2.6% |
| G | 3519 | 2.5% |
| U | 3457 | 2.4% |
| B | 2858 | 2.0% |
| K | 2492 | 1.7% |
| Other values (15) | 13029 | 9.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 432 | |
| . | 8 | 1.8% |
| ' | 2 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 12293 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 156 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 156 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 95 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1255293 | |
| Common | 13142 | 1.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 166311 | |
| e | 163207 | |
| a | 144224 | |
| d | 103145 | |
| s | 101363 | 8.1% |
| i | 80825 | 6.4% |
| r | 65408 | 5.2% |
| t | 61310 | 4.9% |
| l | 57995 | 4.6% |
| o | 54929 | 4.4% |
| Other values (46) | 256576 |
Common
| Value | Count | Frequency (%) |
| 12293 | ||
| , | 432 | 3.3% |
| ( | 156 | 1.2% |
| ) | 156 | 1.2% |
| - | 95 | 0.7% |
| . | 8 | 0.1% |
| ' | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1267470 | |
| None | 965 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 166311 | |
| e | 163207 | |
| a | 144224 | |
| d | 103145 | 8.1% |
| s | 101363 | 8.0% |
| i | 80825 | 6.4% |
| r | 65408 | 5.2% |
| t | 61310 | 4.8% |
| l | 57995 | 4.6% |
| o | 54929 | 4.3% |
| Other values (47) | 268753 |
None
| Value | Count | Frequency (%) |
| ç | 470 | |
| é | 450 | |
| í | 21 | 2.2% |
| ã | 21 | 2.2% |
| ô | 2 | 0.2% |
| Å | 1 | 0.1% |
level1Gid
Text
Missing 
| Distinct | 1487 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 159606 |
| Missing (%) | 54.9% |
| Memory size | 2.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.463638302 |
| Min length | 6 |
Unique
| Unique | 311 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | NLD.14_1 |
|---|---|
| 2nd row | AUS.8_1 |
| 3rd row | GMB.4_1 |
| 4th row | NZL.12_1 |
| 5th row | MDG.2_1 |
| Value | Count | Frequency (%) |
| idn.9_1 | 14953 | 11.4% |
| nld.14_1 | 10559 | 8.0% |
| nld.9_1 | 9828 | 7.5% |
| nld.3_1 | 4946 | 3.8% |
| nld.4_1 | 4599 | 3.5% |
| idn.32_1 | 4239 | 3.2% |
| nld.11_1 | 3806 | 2.9% |
| nld.8_1 | 3327 | 2.5% |
| idn.21_1 | 2715 | 2.1% |
| idn.19_1 | 2571 | 2.0% |
| Other values (1477) | 69749 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 182381 | |
| _ | 131288 | |
| . | 130974 | |
| N | 97543 | |
| D | 94303 | |
| L | 52020 | 5.3% |
| I | 46874 | 4.8% |
| 9 | 32947 | 3.4% |
| 2 | 29421 | 3.0% |
| 4 | 21865 | 2.2% |
| Other values (28) | 160300 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 393406 | |
| Decimal Number | 324248 | |
| Connector Punctuation | 131288 | 13.4% |
| Other Punctuation | 130974 | 13.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 97543 | |
| D | 94303 | |
| L | 52020 | |
| I | 46874 | |
| R | 12803 | 3.3% |
| U | 12027 | 3.1% |
| S | 11947 | 3.0% |
| A | 11663 | 3.0% |
| E | 7425 | 1.9% |
| G | 5986 | 1.5% |
| Other values (16) | 40815 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 182381 | |
| 9 | 32947 | 10.2% |
| 2 | 29421 | 9.1% |
| 4 | 21865 | 6.7% |
| 3 | 20675 | 6.4% |
| 0 | 8105 | 2.5% |
| 8 | 8018 | 2.5% |
| 5 | 7788 | 2.4% |
| 7 | 6598 | 2.0% |
| 6 | 6450 | 2.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 131288 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 130974 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 586510 | |
| Latin | 393406 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 97543 | |
| D | 94303 | |
| L | 52020 | |
| I | 46874 | |
| R | 12803 | 3.3% |
| U | 12027 | 3.1% |
| S | 11947 | 3.0% |
| A | 11663 | 3.0% |
| E | 7425 | 1.9% |
| G | 5986 | 1.5% |
| Other values (16) | 40815 |
Common
| Value | Count | Frequency (%) |
| 1 | 182381 | |
| _ | 131288 | |
| . | 130974 | |
| 9 | 32947 | 5.6% |
| 2 | 29421 | 5.0% |
| 4 | 21865 | 3.7% |
| 3 | 20675 | 3.5% |
| 0 | 8105 | 1.4% |
| 8 | 8018 | 1.4% |
| 5 | 7788 | 1.3% |
| Other values (2) | 13048 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 979916 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 182381 | |
| _ | 131288 | |
| . | 130974 | |
| N | 97543 | |
| D | 94303 | |
| L | 52020 | 5.3% |
| I | 46874 | 4.8% |
| 9 | 32947 | 3.4% |
| 2 | 29421 | 3.0% |
| 4 | 21865 | 2.2% |
| Other values (28) | 160300 |
level1Name
Text
Missing 
| Distinct | 1461 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 159606 |
| Missing (%) | 54.9% |
| Memory size | 2.2 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 10.45431557 |
| Min length | 3 |
Unique
| Unique | 304 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Zuid-Holland |
|---|---|
| 2nd row | South Australia |
| 3rd row | North Bank |
| 4th row | Otago |
| 5th row | Antsiranana |
| Value | Count | Frequency (%) |
| barat | 18889 | 10.3% |
| jawa | 16803 | 9.2% |
| zuid-holland | 10365 | 5.7% |
| noord-holland | 9828 | 5.4% |
| utara | 9113 | 5.0% |
| sumatera | 6314 | 3.5% |
| fryslân | 4946 | 2.7% |
| maluku | 4793 | 2.6% |
| gelderland | 4599 | 2.5% |
| timur | 4042 | 2.2% |
| Other values (1597) | 93030 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 231846 | |
| r | 105670 | 7.7% |
| l | 90239 | 6.6% |
| n | 84908 | 6.2% |
| e | 74291 | 5.4% |
| o | 73856 | 5.4% |
| d | 66662 | 4.9% |
| t | 65823 | 4.8% |
| u | 54960 | 4.0% |
| 51430 | 3.7% | |
| Other values (103) | 472883 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1084612 | |
| Uppercase Letter | 208552 | 15.2% |
| Space Separator | 51430 | 3.7% |
| Dash Punctuation | 26174 | 1.9% |
| Other Punctuation | 1768 | 0.1% |
| Close Punctuation | 12 | < 0.1% |
| Open Punctuation | 12 | < 0.1% |
| Modifier Symbol | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 231846 | |
| r | 105670 | |
| l | 90239 | 8.3% |
| n | 84908 | 7.8% |
| e | 74291 | 6.8% |
| o | 73856 | 6.8% |
| d | 66662 | 6.1% |
| t | 65823 | 6.1% |
| u | 54960 | 5.1% |
| i | 49062 | 4.5% |
| Other values (60) | 187295 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 27853 | |
| H | 21609 | |
| N | 19857 | |
| J | 19737 | |
| S | 16349 | 7.8% |
| U | 13543 | 6.5% |
| Z | 13026 | 6.2% |
| T | 11771 | 5.6% |
| M | 8804 | 4.2% |
| G | 7769 | 3.7% |
| Other values (23) | 48234 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1179 | |
| ' | 383 | 21.7% |
| / | 159 | 9.0% |
| , | 33 | 1.9% |
| ! | 14 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 51430 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 26174 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1293164 | |
| Common | 79404 | 5.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 231846 | |
| r | 105670 | 8.2% |
| l | 90239 | 7.0% |
| n | 84908 | 6.6% |
| e | 74291 | 5.7% |
| o | 73856 | 5.7% |
| d | 66662 | 5.2% |
| t | 65823 | 5.1% |
| u | 54960 | 4.3% |
| i | 49062 | 3.8% |
| Other values (93) | 395847 |
Common
| Value | Count | Frequency (%) |
| 51430 | ||
| - | 26174 | |
| . | 1179 | 1.5% |
| ' | 383 | 0.5% |
| / | 159 | 0.2% |
| , | 33 | < 0.1% |
| ! | 14 | < 0.1% |
| ) | 12 | < 0.1% |
| ( | 12 | < 0.1% |
| ` | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1361854 | |
| None | 10700 | 0.8% |
| Latin Ext Additional | 14 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 231846 | |
| r | 105670 | 7.8% |
| l | 90239 | 6.6% |
| n | 84908 | 6.2% |
| e | 74291 | 5.5% |
| o | 73856 | 5.4% |
| d | 66662 | 4.9% |
| t | 65823 | 4.8% |
| u | 54960 | 4.0% |
| 51430 | 3.8% | |
| Other values (52) | 462169 |
None
| Value | Count | Frequency (%) |
| â | 4986 | |
| á | 1003 | 9.4% |
| í | 907 | 8.5% |
| é | 873 | 8.2% |
| ó | 401 | 3.7% |
| ð | 351 | 3.3% |
| ä | 236 | 2.2% |
| ö | 232 | 2.2% |
| ã | 195 | 1.8% |
| ý | 169 | 1.6% |
| Other values (36) | 1347 | 12.6% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ồ | 8 | |
| ầ | 3 | 21.4% |
| ừ | 1 | 7.1% |
| ế | 1 | 7.1% |
| ả | 1 | 7.1% |
level2Gid
Text
Missing 
| Distinct | 4380 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 161386 |
| Missing (%) | 55.5% |
| Memory size | 2.2 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 10 |
| Mean length | 9.976110322 |
| Min length | 7 |
Unique
| Unique | 1433 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | NLD.14.43_1 |
|---|---|
| 2nd row | AUS.8.23_1 |
| 3rd row | GMB.4.1_1 |
| 4th row | NZL.12.1_1 |
| 5th row | MDG.2.1_1 |
| Value | Count | Frequency (%) |
| idn.9.5_1 | 4808 | 3.7% |
| idn.9.24_1 | 2640 | 2.0% |
| idn.9.16_1 | 2196 | 1.7% |
| nld.14.2_1 | 1727 | 1.3% |
| idn.9.7_1 | 1658 | 1.3% |
| idn.32.4_1 | 1460 | 1.1% |
| idn.32.15_1 | 1399 | 1.1% |
| nld.9.4_1 | 1326 | 1.0% |
| nld.6.1_1 | 1283 | 1.0% |
| idn.9.4_1 | 1268 | 1.0% |
| Other values (4370) | 109747 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 258702 | |
| 1 | 210783 | |
| _ | 129512 | |
| N | 97521 | 7.5% |
| D | 94264 | 7.3% |
| 2 | 64997 | 5.0% |
| L | 51564 | 4.0% |
| I | 46832 | 3.6% |
| 4 | 45478 | 3.5% |
| 9 | 45448 | 3.5% |
| Other values (28) | 246925 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 515758 | |
| Uppercase Letter | 388054 | |
| Other Punctuation | 258702 | |
| Connector Punctuation | 129512 | 10.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 97521 | |
| D | 94264 | |
| L | 51564 | |
| I | 46832 | |
| R | 12368 | 3.2% |
| U | 11995 | 3.1% |
| A | 11590 | 3.0% |
| S | 11200 | 2.9% |
| E | 6988 | 1.8% |
| G | 5568 | 1.4% |
| Other values (16) | 38164 | 9.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 210783 | |
| 2 | 64997 | 12.6% |
| 4 | 45478 | 8.8% |
| 9 | 45448 | 8.8% |
| 3 | 43448 | 8.4% |
| 5 | 28694 | 5.6% |
| 6 | 23220 | 4.5% |
| 8 | 20113 | 3.9% |
| 7 | 17487 | 3.4% |
| 0 | 16090 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 258702 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 129512 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 903972 | |
| Latin | 388054 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 97521 | |
| D | 94264 | |
| L | 51564 | |
| I | 46832 | |
| R | 12368 | 3.2% |
| U | 11995 | 3.1% |
| A | 11590 | 3.0% |
| S | 11200 | 2.9% |
| E | 6988 | 1.8% |
| G | 5568 | 1.4% |
| Other values (16) | 38164 | 9.8% |
Common
| Value | Count | Frequency (%) |
| . | 258702 | |
| 1 | 210783 | |
| _ | 129512 | |
| 2 | 64997 | 7.2% |
| 4 | 45478 | 5.0% |
| 9 | 45448 | 5.0% |
| 3 | 43448 | 4.8% |
| 5 | 28694 | 3.2% |
| 6 | 23220 | 2.6% |
| 8 | 20113 | 2.2% |
| Other values (2) | 33577 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1292026 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 258702 | |
| 1 | 210783 | |
| _ | 129512 | |
| N | 97521 | 7.5% |
| D | 94264 | 7.3% |
| 2 | 64997 | 5.0% |
| L | 51564 | 4.0% |
| I | 46832 | 3.6% |
| 4 | 45478 | 3.5% |
| 9 | 45448 | 3.5% |
| Other values (28) | 246925 |
level2Name
Text
Missing 
| Distinct | 4256 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 161392 |
| Missing (%) | 55.5% |
| Memory size | 2.2 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 9.585540438 |
| Min length | 2 |
Unique
| Unique | 1354 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | Lisse |
|---|---|
| 2nd row | Kangaroo Island |
| 3rd row | Central Baddibu |
| 4th row | Central Otago |
| 5th row | Diana |
| Value | Count | Frequency (%) |
| bogor | 7004 | 4.1% |
| kota | 4404 | 2.6% |
| sukabumi | 2674 | 1.6% |
| manggarai | 2078 | 1.2% |
| de | 1744 | 1.0% |
| s-gravenhage | 1727 | 1.0% |
| serdang | 1718 | 1.0% |
| barat | 1713 | 1.0% |
| cianjur | 1658 | 1.0% |
| tengah | 1627 | 1.0% |
| Other values (4584) | 143265 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 152288 | 12.3% |
| e | 125630 | 10.1% |
| n | 86546 | 7.0% |
| r | 83679 | 6.7% |
| o | 71740 | 5.8% |
| i | 59435 | 4.8% |
| t | 50086 | 4.0% |
| l | 47691 | 3.8% |
| u | 47166 | 3.8% |
| g | 43531 | 3.5% |
| Other values (134) | 473593 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1016711 | |
| Uppercase Letter | 171920 | 13.8% |
| Space Separator | 40106 | 3.2% |
| Dash Punctuation | 7908 | 0.6% |
| Other Punctuation | 3873 | 0.3% |
| Decimal Number | 366 | < 0.1% |
| Open Punctuation | 240 | < 0.1% |
| Close Punctuation | 236 | < 0.1% |
| Modifier Symbol | 24 | < 0.1% |
| Initial Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 152288 | |
| e | 125630 | |
| n | 86546 | 8.5% |
| r | 83679 | 8.2% |
| o | 71740 | 7.1% |
| i | 59435 | 5.8% |
| t | 50086 | 4.9% |
| l | 47691 | 4.7% |
| u | 47166 | 4.6% |
| g | 43531 | 4.3% |
| Other values (69) | 248919 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 22354 | 13.0% |
| S | 16829 | 9.8% |
| M | 14214 | 8.3% |
| K | 12909 | 7.5% |
| T | 10498 | 6.1% |
| H | 8804 | 5.1% |
| C | 7669 | 4.5% |
| A | 7609 | 4.4% |
| D | 7480 | 4.4% |
| L | 7475 | 4.3% |
| Other values (33) | 56079 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 138 | |
| 0 | 63 | |
| 7 | 39 | 10.7% |
| 6 | 36 | 9.8% |
| 2 | 27 | 7.4% |
| 3 | 25 | 6.8% |
| 4 | 20 | 5.5% |
| 5 | 13 | 3.6% |
| 9 | 3 | 0.8% |
| 8 | 2 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 2148 | |
| . | 1637 | |
| , | 60 | 1.5% |
| / | 24 | 0.6% |
| & | 3 | 0.1% |
| # | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 40106 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7908 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 240 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 236 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 24 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‹ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1188631 | |
| Common | 52754 | 4.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 152288 | 12.8% |
| e | 125630 | 10.6% |
| n | 86546 | 7.3% |
| r | 83679 | 7.0% |
| o | 71740 | 6.0% |
| i | 59435 | 5.0% |
| t | 50086 | 4.2% |
| l | 47691 | 4.0% |
| u | 47166 | 4.0% |
| g | 43531 | 3.7% |
| Other values (112) | 420839 |
Common
| Value | Count | Frequency (%) |
| 40106 | ||
| - | 7908 | 15.0% |
| ' | 2148 | 4.1% |
| . | 1637 | 3.1% |
| ( | 240 | 0.5% |
| ) | 236 | 0.4% |
| 1 | 138 | 0.3% |
| 0 | 63 | 0.1% |
| , | 60 | 0.1% |
| 7 | 39 | 0.1% |
| Other values (12) | 179 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1234475 | |
| None | 6826 | 0.5% |
| IPA Ext | 58 | < 0.1% |
| Latin Ext Additional | 25 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 152288 | 12.3% |
| e | 125630 | 10.2% |
| n | 86546 | 7.0% |
| r | 83679 | 6.8% |
| o | 71740 | 5.8% |
| i | 59435 | 4.8% |
| t | 50086 | 4.1% |
| l | 47691 | 3.9% |
| u | 47166 | 3.8% |
| g | 43531 | 3.5% |
| Other values (63) | 466683 |
None
| Value | Count | Frequency (%) |
| â | 1341 | |
| á | 1059 | |
| ú | 785 | |
| é | 654 | |
| ó | 431 | 6.3% |
| í | 407 | 6.0% |
| ð | 278 | 4.1% |
| ö | 253 | 3.7% |
| è | 218 | 3.2% |
| ä | 182 | 2.7% |
| Other values (54) | 1218 |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 58 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ụ | 13 | |
| ạ | 5 | 20.0% |
| ử | 3 | 12.0% |
| ả | 3 | 12.0% |
| ộ | 1 | 4.0% |
Punctuation
| Value | Count | Frequency (%) |
| ‹ | 1 |
level3Gid
Text
Missing 
| Distinct | 3681 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 227914 |
| Missing (%) | 78.3% |
| Memory size | 2.2 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 12.13957513 |
| Min length | 9 |
Unique
| Unique | 1291 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | MDG.2.1.5_1 |
|---|---|
| 2nd row | BEL.2.1.3_1 |
| 3rd row | IDN.18.1.5_1 |
| 4th row | IDN.19.9.2_1 |
| 5th row | IDN.19.6.1_1 |
| Value | Count | Frequency (%) |
| idn.9.5.3_1 | 2876 | 4.6% |
| idn.9.4.13_1 | 1247 | 2.0% |
| idn.9.7.13_1 | 1190 | 1.9% |
| idn.21.9.5_1 | 848 | 1.3% |
| idn.9.16.5_1 | 799 | 1.3% |
| idn.9.16.3_1 | 763 | 1.2% |
| idn.29.9.7_1 | 672 | 1.1% |
| idn.19.6.5_1 | 644 | 1.0% |
| idn.9.16.1_1 | 612 | 1.0% |
| idn.9.24.5_1 | 548 | 0.9% |
| Other values (3671) | 52785 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 188948 | |
| 1 | 127130 | |
| _ | 62984 | 8.2% |
| N | 47817 | 6.3% |
| D | 47033 | 6.2% |
| I | 45947 | 6.0% |
| 2 | 43506 | 5.7% |
| 3 | 33491 | 4.4% |
| 9 | 30877 | 4.0% |
| 5 | 21788 | 2.8% |
| Other values (26) | 115078 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 324197 | |
| Other Punctuation | 188948 | |
| Uppercase Letter | 188470 | |
| Connector Punctuation | 62984 | 8.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 47817 | |
| D | 47033 | |
| I | 45947 | |
| R | 5081 | 2.7% |
| E | 5015 | 2.7% |
| A | 4620 | 2.5% |
| U | 3451 | 1.8% |
| B | 3258 | 1.7% |
| C | 3078 | 1.6% |
| L | 3032 | 1.6% |
| Other values (14) | 20138 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 127130 | |
| 2 | 43506 | 13.4% |
| 3 | 33491 | 10.3% |
| 9 | 30877 | 9.5% |
| 5 | 21788 | 6.7% |
| 4 | 20943 | 6.5% |
| 6 | 14539 | 4.5% |
| 7 | 11698 | 3.6% |
| 8 | 11264 | 3.5% |
| 0 | 8961 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 188948 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 62984 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 576129 | |
| Latin | 188470 | 24.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 47817 | |
| D | 47033 | |
| I | 45947 | |
| R | 5081 | 2.7% |
| E | 5015 | 2.7% |
| A | 4620 | 2.5% |
| U | 3451 | 1.8% |
| B | 3258 | 1.7% |
| C | 3078 | 1.6% |
| L | 3032 | 1.6% |
| Other values (14) | 20138 |
Common
| Value | Count | Frequency (%) |
| . | 188948 | |
| 1 | 127130 | |
| _ | 62984 | 10.9% |
| 2 | 43506 | 7.6% |
| 3 | 33491 | 5.8% |
| 9 | 30877 | 5.4% |
| 5 | 21788 | 3.8% |
| 4 | 20943 | 3.6% |
| 6 | 14539 | 2.5% |
| 7 | 11698 | 2.0% |
| Other values (2) | 20225 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 764599 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 188948 | |
| 1 | 127130 | |
| _ | 62984 | 8.2% |
| N | 47817 | 6.3% |
| D | 47033 | 6.2% |
| I | 45947 | 6.0% |
| 2 | 43506 | 5.7% |
| 3 | 33491 | 4.4% |
| 9 | 30877 | 4.0% |
| 5 | 21788 | 2.8% |
| Other values (26) | 115078 |
level3Name
Text
Missing 
| Distinct | 3486 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 229332 |
| Missing (%) | 78.8% |
| Memory size | 2.2 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 9.412143066 |
| Min length | 2 |
Unique
| Unique | 1204 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | Nosibe |
|---|---|
| 2nd row | Turnhout |
| 3rd row | Jailolo |
| 4th row | Kairatu |
| 5th row | Amahai |
| Value | Count | Frequency (%) |
| caringin | 3007 | 3.4% |
| barat | 2340 | 2.7% |
| bogor | 2175 | 2.5% |
| utara | 1682 | 1.9% |
| tengah | 1483 | 1.7% |
| muara | 1352 | 1.5% |
| selatan | 1306 | 1.5% |
| gembong | 1247 | 1.4% |
| n.a | 1246 | 1.4% |
| cipanas | 1191 | 1.4% |
| Other values (3808) | 70921 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 90654 | |
| n | 47306 | 8.2% |
| i | 40003 | 6.9% |
| r | 33679 | 5.8% |
| e | 31527 | 5.4% |
| o | 30937 | 5.3% |
| u | 29442 | 5.1% |
| 26384 | 4.6% | |
| g | 26350 | 4.5% |
| t | 19974 | 3.4% |
| Other values (100) | 203212 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 455743 | |
| Uppercase Letter | 85691 | 14.8% |
| Space Separator | 26384 | 4.6% |
| Decimal Number | 3871 | 0.7% |
| Other Punctuation | 3280 | 0.6% |
| Dash Punctuation | 1588 | 0.3% |
| Open Punctuation | 1493 | 0.3% |
| Close Punctuation | 1418 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 90654 | |
| n | 47306 | |
| i | 40003 | |
| r | 33679 | 7.4% |
| e | 31527 | 6.9% |
| o | 30937 | 6.8% |
| u | 29442 | 6.5% |
| g | 26350 | 5.8% |
| t | 19974 | 4.4% |
| l | 17227 | 3.8% |
| Other values (47) | 88644 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 10208 | |
| B | 10163 | |
| C | 9261 | |
| T | 7994 | |
| P | 6383 | 7.4% |
| M | 5977 | 7.0% |
| K | 5833 | 6.8% |
| L | 4110 | 4.8% |
| G | 3699 | 4.3% |
| A | 3157 | 3.7% |
| Other values (22) | 18906 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 869 | |
| 2 | 826 | |
| 3 | 416 | |
| 8 | 396 | |
| 0 | 365 | |
| 4 | 321 | 8.3% |
| 9 | 254 | 6.6% |
| 5 | 165 | 4.3% |
| 7 | 165 | 4.3% |
| 6 | 94 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2918 | |
| , | 127 | 3.9% |
| ' | 115 | 3.5% |
| / | 115 | 3.5% |
| : | 2 | 0.1% |
| * | 2 | 0.1% |
| ! | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 26384 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1588 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1493 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1418 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 541434 | |
| Common | 38034 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 90654 | |
| n | 47306 | 8.7% |
| i | 40003 | 7.4% |
| r | 33679 | 6.2% |
| e | 31527 | 5.8% |
| o | 30937 | 5.7% |
| u | 29442 | 5.4% |
| g | 26350 | 4.9% |
| t | 19974 | 3.7% |
| l | 17227 | 3.2% |
| Other values (79) | 174335 |
Common
| Value | Count | Frequency (%) |
| 26384 | ||
| . | 2918 | 7.7% |
| - | 1588 | 4.2% |
| ( | 1493 | 3.9% |
| ) | 1418 | 3.7% |
| 1 | 869 | 2.3% |
| 2 | 826 | 2.2% |
| 3 | 416 | 1.1% |
| 8 | 396 | 1.0% |
| 0 | 365 | 1.0% |
| Other values (11) | 1361 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 578313 | |
| None | 1139 | 0.2% |
| Latin Ext Additional | 16 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 90654 | |
| n | 47306 | 8.2% |
| i | 40003 | 6.9% |
| r | 33679 | 5.8% |
| e | 31527 | 5.5% |
| o | 30937 | 5.3% |
| u | 29442 | 5.1% |
| 26384 | 4.6% | |
| g | 26350 | 4.6% |
| t | 19974 | 3.5% |
| Other values (63) | 202057 |
None
| Value | Count | Frequency (%) |
| ü | 192 | |
| é | 165 | |
| è | 113 | |
| ó | 100 | |
| á | 97 | |
| ã | 71 | 6.2% |
| â | 71 | 6.2% |
| ö | 63 | 5.5% |
| ä | 55 | 4.8% |
| ñ | 29 | 2.5% |
| Other values (24) | 183 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ụ | 13 | |
| ạ | 2 | 12.5% |
| ộ | 1 | 6.2% |
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 167789 |
| Missing (%) | 57.7% |
| Memory size | 2.2 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LC |
|---|---|
| 2nd row | LC |
| 3rd row | LC |
| 4th row | NT |
| 5th row | VU |
| Value | Count | Frequency (%) |
| lc | 91089 | |
| ne | 17845 | 14.5% |
| nt | 7674 | 6.2% |
| vu | 4076 | 3.3% |
| en | 1736 | 1.4% |
| cr | 526 | 0.4% |
| ex | 130 | 0.1% |
| dd | 25 | < 0.1% |
| ew | 8 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 91615 | |
| L | 91089 | |
| N | 27255 | 11.1% |
| E | 19719 | 8.0% |
| T | 7674 | 3.1% |
| V | 4076 | 1.7% |
| U | 4076 | 1.7% |
| R | 526 | 0.2% |
| X | 130 | 0.1% |
| D | 50 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 246218 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 91615 | |
| L | 91089 | |
| N | 27255 | 11.1% |
| E | 19719 | 8.0% |
| T | 7674 | 3.1% |
| V | 4076 | 1.7% |
| U | 4076 | 1.7% |
| R | 526 | 0.2% |
| X | 130 | 0.1% |
| D | 50 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 246218 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 91615 | |
| L | 91089 | |
| N | 27255 | 11.1% |
| E | 19719 | 8.0% |
| T | 7674 | 3.1% |
| V | 4076 | 1.7% |
| U | 4076 | 1.7% |
| R | 526 | 0.2% |
| X | 130 | 0.1% |
| D | 50 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 246218 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 91615 | |
| L | 91089 | |
| N | 27255 | 11.1% |
| E | 19719 | 8.0% |
| T | 7674 | 3.1% |
| V | 4076 | 1.7% |
| U | 4076 | 1.7% |
| R | 526 | 0.2% |
| X | 130 | 0.1% |
| D | 50 | < 0.1% |